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RECOMBINANT ADENOVIRUS AND ADENO— ASSOCIATED VIRUS, 
CELL LINES, AND METHODS OF PRODUCTION AND USE THEREOF 

This invention was supported by the National 
Institute of Health Grant Nos. HD32649-01, DK47757 and 
5 DK49136 . The United States government has rights in this 

invention. 

Field of the Invention 

The present invention relates generally to the fi eld 

of somatic gene therapy, and specifically to methods and 
10 compositions useful in the treatment of genetic 
disorders . 

Background of the invention 

Adenoviruses are eukaryotic DNA viruses that can be 
modified to efficiently deliver a therapeutic or reporter 
15 transgene to a variety of cell types [see, e.g., M. S. 
Horwitz et al, "Adenoviridae and Their Replication", 
Virology , second edition, pp. 1712, ed. B. N. Fields et 
al. Raven Press Ltd., New York (1990)]. Recombinant 
adenoviruses (rAds) are capable of providing extremely 
20 high levels of transgene delivery to virtually all cell 
types, regardless of the mitotic state. The efficacy of 
this system in delivering a therapeutic transgene in vivo 
that complements a genetic imbalance has been 
demonstrated in animal models of various disorders [K. F. 
25 Kozarsky et al. Somatic Cell Mol. Genet. . 19:449-458 

(1993) ("Kozarsky et al I"); K. F. Kozarsky et al, J. 

Biol. Chem. . 269 :13695-13702 (1994) ("Kozarsky et al II) 

and others]. The use of recombinant adenoviruses in the 
transduction of genes into hepatocytes in vivo has 
30 previously been demonstrated in rodents and rabbits [see, 
e.g., Kozarsky II, cited above, and s. ishibashi et al, 

J. Clin. Invest. . 92:883-893 (1993)]. 

The first-generation recombinant, replication- 
deficient adenoviruses which have been developed for gene 
35 therapy contain deletions of the entire Ela and part of 
the Elb regions. This replication-defective virus is 
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grown on an adenovirus— transformed , complementation human 
embryonic kidney cell line containing a functional 
adenovirus Ela gene which provides a transacting Ela 
protein, the 293 cell [ATCC CRL1573] . El-deleted viruses 
5 are capable of replicating and producing infectious virus 
in the 293 cells, which provide Ela and Elb region gene 

products in-trans. — The— resulting— virus— is-capable- of 

infecting many cell types and can express the introduced 
gene (providing it carries its own promoter) , but cannot 
10 replicate in a cell that does not carry the El region DNA 
tinless the cell is infected at a very high multiplicity 
of infection. 

Adeno-associated virus (AAV) is an integrating human 
DNA parvovirus which has been proposed for use as a gene 
15 delivery vehicle for somatic gene therapy [B. J. Carter, 
in "Handbook of Parvoviruses", ed. , P. Tijsser, CRC 
Press, pp. 155-168 (1990)]. This small non-enveloped 
virus contains a 4.6 kb single stranded (ss) DNA genome 
that encodes sets of regulatory and capsid genes called 
20 rep and cap. Rep polypeptides (rep78, rep68, rep62 and 
rep40) are involved in replication, rescue and 
integration of the AAV genome. The cap proteins (VP1, 

VP 2 and VP3 ) form the virion capsid. Flanking the rep 
and cap open reading frames at the 5 ' and 3 ' ends are 145 
25 bp inverted terminal repeats (ITRs) , the first 125 bp of 
which are capable of forming Y- or T-shaped duplex 
structures. 

Recombinant forms of AAV (rAAV) have been developed 
as vectors by replacing all viral open reading frames 
30 with a therapeutic minigene, while retaining the 

necessary cis elements contained in the ITRs. [See, 
e.g. , US Patent Nos. 4,797,368; 5,153,414; 5,139,941; 
5,252,479; and 5,354,678; and International Publication 
Nos. WO 91/18088 published November 28, 1991; WO 93/24641 
35 published December 9, 1993 and W094/13788 published June 
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23, 1994]. However, progress towards establishing AAV as 
a transducing vehicle for gene therapy has been slow for 
a variety of reasons. Por example, the integrated 
provirus preferentially targets specific sites in 
chromosome 19. Additionally, difficulties surround large- 
scale production of replication defective recombinants. 

— The^cel Is employed to-pr oduce rAAV must alsobeinf ected 
with adenovirus or herpesvirus to provide the necessary 
helper functions, thereby producing problems in purifying 
recombinant AAV (rAAV) from contaminating virus in 
culture. Practical experience with purified recombinant 
AAV as a gene therapy vector has been disappointing, 
because the more purified the AAV is from co-infection 
with its helper virus in culture, the lower the gene 
transduction efficiencies that the rAAV displays. 

There remains a need in the art for additional 
recombinant adenoviruses and rAAV, therapeutic 
compositions and methods which enable effective use of 
these recombinant viruses in the treatment of disorders 
and diseases by gene therapy. 

Summary of th e Invention 

In one aspect of this invention, a packaging cell 
l^- ne is provided which expresses adenovirus genes Ela, 

Elb and E4, or functional fragments thereof, e.g. , the E4 
open reading frame (ORF) 6. 

In another aspect, the invention provides a rAd 
comprising the DNA of at least a portion of the genome of 
an adenovirus having functional deletions of the El and 
E4 gene regions; a suitable gene operatively linked to 
regulatory sequences directing its expression, and an 
adenovirus capsid, the rAd capable of infecting a 
mammalian cell and expressing the gene product in the 
cell in vivo or in vitro. The invention also provides a 
mammalian cell infected with the rAd described above. 
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In still another aspect, the invention provides a 
rAd shuttle vector comprising the DNA of at least a 
portion of the genome of an adenovirus having functional 
deletions of the El and E4 gene regions. 

5 In a further aspect, the invention provides a method 

for producing the above-described recombinant Ad and a 

methodfordelivering aselected gene into— a— mammalian 

cell using the recombinant Ad described above. 

In another aspect, the invention provides a method 
10 for enhancing the efficiency of transduction of a 

recombinant AAV into a target cell. The method operates, 
in brief, by infecting a target cell with a ss 
recombinant adeno-associated virus (rAAV) which comprises 
a transgene operatively linked to regulatory sequences 
15 directing its expression, and contacting the infected 

cells with an agent which facilitates the conversion of 
ss rAAV to its double stranded (ds) form. Conversion of 
ss rAAV to ds rAAV occurs in the target cell, resulting 
in enhanced transduction of the rAAV into the target 
20 cell. The agent may be a helper virus which carries a 
selected gene or functional fragment thereof encoding a 
polypeptide capable of enhancing the conversion of the ss 
rAAV to ds rAAV and which is co-infected into the same 
target cell. The agent may also be a drug or chemical 
25 composition which accomplishes the same function and is 
applied to the infected target cell. This method can 
operate both in an ex vivo setting and in vivo. 

In yet another aspect, the invention provides a 
novel recombinant AAV, which contains both the transgene 
30 intended for use in treating a genetic disease or 

disorder and at least one additional gene operatively 
linked to inducible or constitutive regulatory sequences. 
The additional gene(s) encodes a polypeptide capable of 
facilitating, alone or in concert with other additional 
35 genes, the conversion of ss rAAV to its ds form upon 
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expression. In a preferred embodiment, the additional 
gene is adenovirus £4 or a functional fragment thereof. 
Also disclosed is a method for enhancing the efficiency 
of transduction of the novel rAAV into a target cell. 

5 The novel rAAV and methods of this invention are 

also useful in pharmaceutical compositions for use in ex 

vivo and iji vivo gene-therapy treatment protocols for— 

treating inherited diseases, cancer, and other genetic 
dysfunctions . 

10 Other aspects and advantages of the present 

invention are described further in the following detailed 
description of the preferred embodiments thereof. 

Prjef Description of the Prayings 

Fig. 1 is a schematic drawing o£ an exemplary 
15 plasmid pMMTVE40RF6 [SEQ ID NO: 1] or pMTE40RF6 , which 
contains an MMTV or sheep MT promoter, respectively, in 
control of a human E4 ORF 6 gene sequence, a growth 
hormone gene terminator sequence (GH) , an SV 40 ori, 
pBR3 22— based plasmid sequences including a neo* gene, an 
20 SV40 polyA site and an amp 2 gene. 

Fig. 2 is a schematic map of rAd H5.001CB LacZ [SEQ 
ID NO: 3] with indicated restriction endonuclease enzyme 
sites. The striated bar represents the CBLacZ minigene; 
the black bar represents Ad5 viral backbone, the 
25 crossha tched bar represents Ad E4 deletion. 

Fig. 3 plots LacZ forming units (LFU) /ml vs time 
(hours) for E4 complementing cell lines infected with 
H5 . OOlCBiacZ. 

Fig. 4A is a graph of the induction, ORF 6 expression 
30 and viral production in 293-27-18 packaging cells 

plotting yield at 24 hours post-infection (pi) in LFU/ml 
and ORF6 protein (abs.mm) vs. concentration of the 
inducer, dexamethasone (n M) . Abs.mm is the intensity of 
the size of the protein band on a Western blot and 
35 reflects absorbance and protein size in mm 2 . The square 
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is yield at 24 hours pi. The diamond is ORF6 protein 
detected at 24 hours pi. 

Fig. 4B is a similar graph to that of Fig. 4A, 
except that the packaging cells are 293-10-3 cells. The 
5 symbols are as described for Fig. 4A. 

Fig. 5A is a bar graph plotting S-galactosidase 

enzyme— activity— in— lysates— from— inf ected-Hela— cells. — The 

horizontal axis indicates the adenoviruses infected into 
the HeLa cells, with the symbol M + M indicating the 
10 addition of the adenovirus to the rAAV, AV.CM VLacZ. The 
vertical axis indicates intracellular fi-galactosidase 
specific activity (mUnits/mg protein) using ONPG. Below 
each bar, the fold- induction in specific activity 
relative to cells that received the AV.CM VLacZ vector 
15 alone is given. 

Fig. 5B is a bar graph plotting Ad multiplicity of 
infection (MOI) in HeLa cells of wild-type Ad5 or the E2 
mutant dl802, the cells co- infected with rAAV vs. 
intracellular B-galactosidase specific activity. See 
20 Example ll. 

Fig. 6A is a graph in which B-galactosidase specific 
activity and counts per minute (CPM) are plotted along 
the vertical axis and adenovirus MOl's are on the 
horizontal axis for HeLa cells infected with wtAd5 and 
25 rAAV according to Example 12. Data obtained from low MOI 
(1, 5, and 10) infections are shown. 

Fig. 6B is a graph similar to that of Fig. 6A except 
that the cells were infected with Ad mutant dl802. 

Fig. 7A illustrates a model for leading strand 
30 synthesis of a complementary AAV strand in the presence 
of Rep (+Rep) or absence of Rep (-Rep) . Rep expresses a 
terminal resolution activity that can convert a duplex 
structure with closed-ends to an open-ended duplex. In 
the absence of Rep, terminal resolution is impaired 
35 leaving the covalently closed, hairpin structures intact. 
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Under 'these conditions, hairpins are expected to be found 
leftward and rightward, since both strands of a rescued 
ds AAV genome are packaged into virions. 

Fig. 7B is a schematic of linear AV.CMVLacZ with 
5 labeled domains including the AAV XTRs, CMV immediate 
early enhancer/ promoter (CMV) , SV40 splice donor-splice 

acceptor (SD/SA), E. coiiB— galactosidasecDNA(iacZ), 

and SV40 polyA signal (pA) . Two Notl sites located at bp 
positions 1035 and 4509 are indicated. 

10 Fig. 7C illustrates a closed end and an open end 

fragment of r AV . CMVAa cZ . 

Figs. 7D, 7E and 7F indicate the mixture of 
open-ended and covalently closed duplex fragments 
generated by Notl digestion of ss AV. CMV Lac Z at position 
15 4509 in the absence of terminal resolution. The Notl 

4509 digestion provides a convenient means of releasing a 
361 bp fragment that contains the right ITR in the 
context of a hybridization target (i.e. SV40 pA) . In the 
presence of terminal resolution, only the open-ended 361 
20 bp fragment would be expected to be generated (Fig. 7D) 
by such digestion. 

Fig. 8A is a bar graph plotting fi-galactosidase 
specific activity (mUnits/mg protein) vs. increasing 
concentration of zinc (tM) inducer for cell line 293 (MT- 
25 ORF6) transduced with AVCM VLacZ (first row below each 
bar) . Also provided is the fold-induction relative to 
293 cells (second row below each bar) , and the 
fold-induction relative to 293 (ORF6) cells maintained in 
the absence of zinc (third row) . 

30 Fig. 8B is a bar graph plotting CPM of duplex 

monomer replicative form (RFm) of rAAV vs. the 
concentration of zinc (fM) used for induction and the 
fold- induct ion relative to 293 (ORF6) cells maintained in 
0 mM zinc below each bar. 
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Fig. 8C is a graphical comparison of the induction 
profiles that describe AV.CMV£ac2 transduction 
efficiency. Specific activity data from Fig. 8A and CPM 
data of AV . CKVLacZ RFm from Fig. 8B are plotted along the 
5 vertical axis, and concentration of zinc sulfate used 

during the experiment is shown along the horizontal axis. 

: Fig. 9 is a bar graph plotting specific activity 

(milliunits fi-galactosidase/mg protein) vs the 
concentration of zinc used for induction (first row under 
10 the horizontal axis) , the fold-induction relative to HeLa 
cells (second row) , and the fold-induction relative to 
HeLa (Mt— ORF6 ) cells maintained in the absence of zinc 
(third row), for the HeLa (MT-ORF6 ) cells transduced at an 
HOI of 1,000 AV.CM VLacZ virus particles/cell in the 
15 absence of zinc sulfate inducer or in the presence of 50, 
100, 150, 200 or 250 (M zinc sulfate inducer. 

Fig. 10 is a schematic of the plasmid pAV.CK VLa.cZ 
[SEQ ID NO: 4]. 

Fig. ll illustrates plasmid pAV . CMVALP . GRE-ORF 6 [SEQ 
20 ID NO: 5] . 

Detailed Descriptio n Q Jf_t h e I nvention 

The present invention provides packaging cell lines, 
which enable the production of recombinant adenoviruses 
(rAd) functionally deleted in both the El and E4 genes. 

25 These rAd and methods which enable the therapeutic 

treatment of disorders with such rAds are disclosed. 

Novel "second generation" recombinant adeno— associated 
virus (rAAV) and methods for enhancing the transduction 
efficiency of rAAV containing a transgene for expression 
30 in a somatic gene therapy protocol are also provided. 

The methods and compos itions of this invention are useful 
in ex vivo applications of gene therapy, such as in the 
transduction of bone marrow cells with desirable 
hematopoietic stem cell progenitor genes prior to bone 
marrow transplantation. The embodiments of the invention 
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are also useful in pharmaceutical compositions for direct 
in vivo treatment of patients by gene therapy vectors, 
including the transduction of desirable genes in patients 
with genetic disorders, such as cystic fibrosis* 

5 J. Packaging Call Lines 

To increase the transgene capacity and decrease 
immune response of^Ads , as many~viral“genes~as possible 
should be deleted to inactivate the adenovirus. However, 
it is crucial to generate complementing cell lines for 
10 construction and propagation of such deleted Ad* The 

method and compositions of the present invention overcome 
several problems previously identified in the gene 
therapy for first generation El deleted adenoviruses and 
display advantages in administration particularly to 
15 muscle tissue. 

Early region 4 (E4) of Ad serotype 5 consists of 7 
ORFs believed to be involved in viral DNA replication, 
host cell shut-off, and late mRNA accumulation. To 
generate rAd deleted in E4, the function of the E4 region 
20 must be supplied to the rAd by a helper virus or 

packaging cell line. However, useful packaging cell 
lines have not been available previously because normally 
the continuous expression of functioning Ad El and 
functional E4 in a single cell line are toxic to the 
25 cell. Such cells are therefore not useful for the growth 
and replication of rAds. Further, the DNA encoding the 
functional Ad El and Ad E4 genes, when present in a 
packaging cell line, can increase the chances of 
recombination with a rAd virus to cause the virus to 
3 0 revert to a wildtype Ad virus. 

The present invention avoids these problems by 
providing a packaging cell line which contains the Ad5 El 
gene and only the ORF 6 of the Ad5 E4 gene. ORF6 of E4 
alone can provide the requirements for E4 in the viral 
life cycle. 
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According to this invention, the ORF6 is preferably 
under the transcriptional control of an inducible 
promoter. The mouse mammary tumor virus (MMTV) promoter, 
inducible by a glucocorticoid, particularly, 

5 dexamethasone, is presently preferred. The DNA sequence 
of the MMTV promoter spans nucleotides 1-1506 of SEQ ID 
NOT - 17 Another inducible promoter is the sheep ~ 
metal lothionine (MT) promoter, inducible by zinc [M. G. 
Peterson et al, Eur. J. Biochem. . AZAM17-424 (1988)]. 

10 However, the zinc sulfate inducer of the MT promoter can 
itself be toxic to the cells. Other inducible promoters, 
such as those identified in International patent 
application W095/13392, published May 18, 1995, and 
incorporated by reference herein may also be used in the 
15 production of packaging cell lines according to this 
invention. Constitutive promoters, such as the 
constitutive Ad5 E4 region promoter, LTR, may be employed 
in control of the expression of ORF6. 

The packaging cell line of the invention which 
20 utilizes an inducible promoter permits one to control the 
development of toxicity by regulating the expression of 
the E4 ORF6 gene. After the desired shuttle vector 
containing the Ad sequences is transfected into the cell 
line, expression of the E4 ORF6 can be induced by the 
25 appropriate inducer. The packaging cell is thus able to 
provide both Ad El and Ad E4 0RF6 gene products to the 
rAd for a sufficient period to allow productive infection 
and recovery of the rAd, before the cell becomes toxic. 

At present, the time period before the cell experiences 
30 toxicity is about 10 days. 

In its most preferred form, the packaging cell line 
is a human embryonic kidney (HEK) 293 El expressing cell 
line into which is introduced the E4 ORF 6 sequence under 
the control of the inducible promoter. It should be 
understood by one of skill in the art that another parent 
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cell line may be selected for the generation of a novel 
cell line expressing the Ela, Elb, and E4 ORF6 genes of a 
selected adenovirus serotype. Among such parent cell 
lines may be included Hel>a [CCL 2], A549 [CCL 185], KB 
5 [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 
[ATCC CCL 75] cells. These cell lines are all available 

the American Type - Culture” Coll ect i on, 12301 P ar kl awn - 
Drive, Rockville, MD, USA. Other suitable parent cell 
lines may be obtained from other sources. If such parent 
10 cell lines were selected for modification, the cell line 
would need to be further supplied with the Ela and Elb 
gene functions, e.g. , such as by transfection with a 
plasmid containing these genes or functional fragments 
thereof under a suitable promoter, as well as with the 
15 ORF6 gene as described herein. 

Example 1 teaches construction of packaging cell 
lines containing only the ORF 6 of Ad5 E4 region or, for 
functional comparisons, the entire E4 region. Briefly 
described, the entire E4 region and an ORF 6 sequence of 
20 Ad 5 E4 gene were obtained by known techniques [see, 

e.g., Sambrook et al., "Molecular Cloning. A Laboratory 
Manual.", 2d edit.. Cold Spring Harbor Laboratory, New 
York (1989) and references cited therein]. To isolate 
the ORF 6 region, the anchored PCR technique was used to 
25 amplify the ORF6 sequence from its initiation codon to 
its termination codon. Primers selected from the 
published sequence of ORF6 were used to amplify the ORF 
sequence and insert restriction sites onto the end of the 
sequence. The E4 ORF6 sequence itself is reproduced as 
30 nucleotides 1523 through 2408 of SEQ ID HO: 1. The 
entire E4 gene sequence is published in the Genbank 
sequence of Ad5 [Genbank Accession No. M73260] • 

A minigene was constructed that placed the ORF6 
sequence under the control of a selected promoter • By 
35 "minigene" as used here is meant the combination of the 




WO 96/39530 



PCT/US96/1 0245 



12 

desired sequence to be expressed (in this particular 
instance, the 0RF6 sequence) and the other regulatory 
elements necessary to transcribe the desired sequence and 
express the gene product in a cell containing that 
5 minigene. The ORF6 sequence gene is operatively linked 
to regulatory components in a manner which permits its 

-transcription. Such-components— include conventional 

regulatory elements, such as a promoter to drive ORF6 
expression. One inducible promoter was the Zn +2 
10 inducible MT promoter; the other was the dexamethasone- 
inducible MMTV promoter of SEQ ID HO: 1. 

The minigene also contains nucleic acid sequences • 
heterologous to the ORF6 viral sequence, including 
sequences providing signals required for efficient 
polyadenylation of the transcript (poly-A or pA) . A 
common poly-A sequence which is employed in this 
invention is that derived from the growth hormone (GH) 
gene terminator sequence (nuc. 2409-3654 of SEQ ID NO: 

1) . The poly-A sequence generally is inserted in the 
minigene following the ORF6 sequence. The polyA sequence 
employed in the MMTV-ORF6 minigene described in Example 1 
[SEQ ID NO: 1] is supplied by the GH gene terminator and 
an SV40 origin of replication (ori) . A similar minigene 
differing in promoter sequence, polyA sequence and/or 
SV40 ori can also be designed by one of skill in the art 
to transfer the E4 ORF6 sequence to a shuttle plasmid. 
Selection of these and other common vector elements are 
conventional [see, e.g., Sambrook et al, cited above, and 
references cited therein] and many such sequences are 
available from commercial and industrial sources as well 
as from Genbank. 

The ORF6-containing minigene was subcloned into a 
pBR322-based shuttle plasmid that contained a neomycin 
resistance gene, resulting in the shuttle vector of Fig. 

1. Any of the many known bacterial shuttle vectors may 
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be employed to carry the minigene, providing that the 
vector contains a reporter gene or selectable marker of 
which many, e.g. , neo, amp or purimycin, are known in the 
art. It is expected that one of skill in the art can 
5 develop other suitable shuttle vectors using other 
plasmid components which are similarly capable of 

transferring the ORF6 minigene into the chromosome ofa 

cell transfected with the plasmid. 

As further described in Example 1, other shuttle 
10 vectors were designed for comparative purposes, which 
contain the complete or substantially complete Ad5 E4 
region under the control of the constitutive retroviral 
MLV LTR sequence in the presence or absence of the 
endogenous E4 promoter. The shuttle plasmid carrying the 
15 ORF6 minigene (or the entire E4 region) was introduced 

into HEK 293 cells which express the Ad El gene products, 
complementing cell lines were generated that express 
these Ad E4 or ORF6 genes from either their endogenous 
promoters or heterologous inducible promoters. These 
20 cell lines are further characterized by their genetic 
constitution, E4 protein synthesis, recombinant AAV 
helper function, relative plaque efficiency of H5dll004 
virus, and growth kinetics of recombinant E1/E4 deleted 
adenovirus. These characteristics of exemplary E1/E4 
25 expressing packaging cell lines are discussed in detail 
in the following examples. 

II. Recombinant Adenovirus 

The E1/E4 expressing cell line is useful in 
constructing E1/E4 deleted rAds which can deliver a 
30 suitable gene to mammalian cells and tissues. These rAd 
are functionally deleted in at least the Ela, Elb and E4 
Ad gene regions. By the term "functionally deleted" is 
meant that a sufficient amount of the gene region is 
removed or otherwise damaged, e.g., by mutation or 
35 modification, so that the gene region is no longer 
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capable of producing the products of gene expression. If 
desired, the entire gene region may be removed. In in 
vivo experiments with the rAd grown in the packaging cell 
lines, the E1/E4 deleted rAd demonstrated utility 
5 particularly in transferring a transgene to a muscle 
cell. 

The adenovirus sequences used in the construction of 
the shuttle vectors, helper viruses, if needed, and rAd, 
and other components and sequences employed in the 
10 construction of the vectors and viruses described herein 
may be readily obtained from commercial or academic 
sources based on previously published and described 
sequences. Viral materials may also be obtained from an 
individual patient. The viral sequences and vector 
15 components may be generated by resort to the teachings 
and references contained herein, coupled with standard 
recombinant molecular cloning techniques known and 
practiced by those skilled in the art. Modifications of 
existing nucleic acid .sequences forming the vectors, 

20 including sequence deletions, insertions, and other 

mutations taught by this specification may be generated 
using standard techniques. Similarly, the methods 
employed for the selection of viral sequences useful in a 
vector, the cloning and construction of the "minigene" 

25 and its insertion into a desired viral shuttle vector and 
the production of a recombinant infectious virus are 
within the skill in the art given the teachings provided 
herein. 

A. Construction of the Transgene 
30 A "minigene" in this context is defined as 

above, except that the components of this minigene are 
designed to express the gene product ex vivo or in vivo. 
Such components include conventional regulatory elements 
necessary to drive expression of the transgene in a cell 
transfected with the rAd. For this minigene, a selected 
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promoter is operatively linked to the transgene and 
located, with other regulatory elements, within the 
selected viral sequences of the recombinant vector. 
Selection of the promoter is a routine matter and is not 
5 a limitation of this invention. Useful promoters may be 
constitutive promoters or regulated (inducible) 
promoters , which will - enable control - ^ of the amount o f ~ the 
transgene to be expressed. For example, a desirable 
promoter is that of the cytomegalovirus (CMV) imm ediate 
10 early promoter/ enhancer [see, e.g., Boshart et al. Cell, 
41:521-530 (1985)]. Another desirable promoter includes 
the Rous sarcoma virus LTR promoter /enhancer. Still 
another promoter /enhancer sequence is the chicken 
cytoplasmic B— actin (CB) promoter [T. A. Kost et al, 

15 Nucl . Acids Res . . ±1(23) :8287 (1983)]. Other suitable 
promoters may be selected by one of skill in the art. 

The minigene may also desirably contain nucleic 
acid sequences heterologous to the viral vector sequences 
including poly-A sequences and introns with functional 
20 splice donor and acceptor sites, as described above. The 
poly-A sequence generally is inserted in the minigene 
following the transgene sequences and before the 3 * 
adenovirus sequences. A minigene of the present 
invention may also contain an intron, desirably located 
25 between the promoter/ enhancer sequence and the transgene. 
Selection of these and other common vector elements are 
conventional as described above and many such sequences 
are available from commercial and industrial sources as 
well as from Genbank. 

As above stated, the minigene is located in the 
site of any selected deletion in the rAd. In the E1/E4 
deleted rAd H5. QOlCBLacZ, the transgene is located in the 
deleted El gene region. However, the transgene may be 
located elsewhere in the adenovirus sequence, as desired. 
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B. Production of Recombinant Adenoviirus 

Adenovirus sequences useful in this invention 
may include the DNA sequences of a number of adenovirus 
• types, which are available from Genbank, including type 
5 Ad5 [Genbank Accession No. M73260]. The adenovirus 
sequences may be obtained from any known adenovirus 

serotype, such as-serotypes-2^ — 3 - 7 — 4 , — l- f — 12— and— 40,— and 

further including any of the presently identified 41 
human types [see, e.g., Horwitz, cited above]. Similarly 
10 adenoviruses known to infect other animals may also be 

employed in the vector constructs of this invention. The 
selection of the adenovirus type is not anticipated to 
limit the following invention. A variety of adenovirus 
strains are available from the American Type Culture 
15 Collection, Rockville, Maryland, or available by request 
from a variety of commercial and institutional sources. 

In the following exemplary embodiment an adenovirus, type 
5 (Ad5) is used for convenience. 

However, it is desirable to obtain a variety of 
20 adenovirus shuttle vectors based on different human 

adenovirus serotypes. It is anticipated that a library 
of such plasmids and the resulting rAds would be useful 
in a therapeutic regimen to evade cellular, and possibly 
humoral, immunity, and lengthen the duration of transgene 
25 expression, as well as improve the success of repeat 

therapeutic treatments. Additionally the use of various 
serotypes is believed to produce rAd with different 
tissue targeting specificities. Additionally, the 
absence of adenoviral genes El and E4 in the rAd of this 
30 invention should reduce or eliminate adverse CTL 

responses which normally cause destruction of rAds 
deleted of only the El gene. 

rAds of this invention are recombinant, 
defective adenoviruses (i.e., El deleted) which are also 
deleted completely or functionally of the E4 gene region. 
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Functional deletions of E4 gene regions may be assessed 
by assays of Examples 2 and 3, among other assays. rAds 
useful in this invention may optionally bear other 
mutations, e.g., temperature sensitive (ts) mutations in 
5 the E2a gene region, and deletions in the E3 gene 
regions • 

An adenovirus of this invention contains a 

functional deletion of the adenoviral early immediate 
early gene Ela (vhich spans mu 1.3 to 4.5) and delayed 
10 early gene Elb (vhich spans mu 4.6 to 11.2). Similarly 
the adenovirus has a functional deletion of the whole E4 
region (vhich spans mu 92 to 97.2), or of at least ORF6 
of the E4 region. Gene regions vhich may be optionally 
deleted in the E1/E4 deleted rAd of this invention 
15 include all or a portion of the adenovirus delayed early 
gene E3 (vhich spans mu 76.6 to 86.2). The function of 
E3 is irrelevant to the function and production of the 
rAd. 

The rAd of this invention may also have a 
20 mutation vhich results in reduced expression of 

adenoviral protein and/or reduced viral replication. For 
example, a ts mutation may be introduced into the 
adenovirus delayed early gene E2a (vhich spans mu 67.9 to 
61.5). Among such mutations include the incorporation of 
25 the missense ts mutation in the (DBP)E2a region found in 
the Ad5 H5tsl25 strain [P. Vander Vliet et al, J, Virol t* 
1^2348-354 (1975)] at 62.5 mu. A single amino acid 
substitution (62.5 mu) at the carboxy end of the 72 kd 
protein produced from the E2a gene in this strain 
30 produces a protein product vhich is a ss DNA binding 

protein and is involved in the replication of adenoviral 
genomic DNA. At permissive temperatures (approximately 
32 °C) the ts strain is capable of full life cycle growth 
on HeLa cells, while at non-permissive temperatures 
35 (approximately 38°C) no replication of adenoviral DNA is 
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seen* In addition, at non-permissive temperatures, 
decreased immunoreactive 72 kd protein is seen in HeL*a 
cells. See, e.g., J. F. Engelhardt et al. Hum * Gene 
Ther. . 5:1217-1229 (1994) ; *T. F. Engelhardt et al, Proc* 

5 Natl^Acad* Sci*. USA * £ 1 : 6196-6200 (1994) and 

International patent application W095/13392, published 

May— 1 - 87 — 1995— incorporated- by- refer ence-herein. 

However , it should be understood that other 
deletions in the adenovirus genome as previously 
10 described in the art or otherwise may also occur in the 
rAds of this invention. One minimal type of rAd can 
contain adenovirus genomic sequences from which all viral 
genes are deleted. More specifically, the adenovirus 
sequences may be only the cis-acting 5 1 and 3 • inverted 
15 terminal repeat (ITR) sequences of an adenovirus (which 
function as oris) and the native 5 1 packaging/enhancer 
domain, that contains sequences necessary for packaging 
linear Ad genomes and enhancer elements for the El 
promoter. The adenovirus 5 1 sequence containing the 5' 

20 ITR and packaging/ enhancer region (Ad5 mu 0-1 or bp 1- 

360) can be employed as the 5 1 adenovirus sequence in rAd 
of this invention* The 3* adenovirus sequences including 
the right terminal (3 9 ) ITR sequence of the adenoviral 
genome spanning about bp 35,353 - end of the adenovirus 
25 genome, or map units "98.4-100 may be desirably employed 
as the 3 9 sequence of the rAd. These sequences, which 
are clearly devoid of the El and E4 genes, can flank, or 
be operatively associated with the minigene in a rAd* 

Any other necessary Ad gene products will then be 
30 supplied by helper viruses and the E1/E4 ORF6 expressing 
packaging cell of this invention. 

Exemplary rAd for use in this invention, for 
example, may be obtained by homologous recombination of 
desired fragments from various rAd, a technique which has 
been commonly employed to generate other rAd for gene 
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'therapy use. In 'the examples below, a representative 
rAd, H5. OOlC&LacZ, is constructed by homologous 
recombination between the adenovirus dll004 (also 
H5dll004) viral backbone and pAdCRLacZ minigene DNA. 

5 H5dll004 is an Ad5 virus deleted of from about map unit 
92.1 through map unit 98, i.e, substantially the entire 

E4gene. — The _ dT1004 — virus— is— described— ^ in— Br-idge— and 

Ketner, J. Virol. . 121(2) : 631-638 (Feb. 1989). 

The pAdCB LacZ vector is a cDNA plasmid 
10 containing Ad m.u. 0-1, an El deletion into which is 
inserted a bacterial fi-galactosidase gene under the 
control of a chicken 6-actin promoter, with other 
regulatory elements as described below, and flanked by Ad 
m.u. 9-16 and plasmid sequence. 

15 The production of the E1/E4 rAd of this 

invention in the packaging cell line of this invention 
utilizes conventional techniques. Such techniques 
include conventional cloning techniques of cDNA such as 
those described in texts [Sambrook et al, cited above], 

20 use of overlapping oligonucleotide sequences of the 

adenovirus genomes, PCR and any suitable method which 
provides the desired nucleotide sequence. Standard 
transfection and co-transfection techniques are employed, 
e.g., CaP0 4 transfection techniques using the 
25 complementation 293 cell line. Other conventional 

methods employed include homologous recombination of the 
viral genomes, plaquing of viruses in agar overlay, 
methods of measuring signal generation, and the like. 

For example, following the construction and 
30 assembly of the desired minigene— containing plasmid 

vector pAdCB LacZ, the E1/E4 expressing packaging cell 
line of this invention is infected with the helper virus 
H5dll004. The infected cell line is then subsequently 
transfected with an adenovirus plasmid vector by 
35 conventional methods. Homologous recombination occurs 
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between the £4 -deleted H5dll004 helper and the pAdC BLacZ 
vector, which permits the adenovirus-transgene sequences 
in the vector to be replicated and packaged into virion 
capsids, resulting in the rAd. About 30 or more hours 
5 post-transfection, the cells are harvested, an extract 
prepared and the rAd containing the LacZ transgene is 

purifiedbybuoyantdensityultracentrifugation in aCsCl 

gradient. 

III. Use of the Recombinant Virus in Gene Therapy 
10 The rAd containing the transgene produced by . 

cooperation of the adenovirus vector and £4 deleted 
helper virus and packaging cell line, as described above, 
provides an efficient gene transfer vehicle which can 
deliver the transgene in a pharmaceutical composition to 
15 a patient in vivo or ex vivo and provide for integration 
of the gene into a mammalian cell. 

The rAd are administered to humans in a conventional 
manner for gene therapy and serve as an alternative or 
supplemental gene therapy for the disorder to which the 
20 transgene is directed. A rAd of this invention may be 
administered to a patient, preferably suspended in a 
biologically compatible solution or pharmaceutically 
acceptable delivery vehicle. A suitable vehicle includes 
sterile saline. Other aqueous and non-aqueous isotonic 
25 sterile injection solutions and aqueous and non-aqueous 
sterile suspensions known to be pharmaceutically 
acceptable carriers and well known to those of skill in 
the art may be employed for this purpose. 

The rAd are administered in sufficient amounts to 
30 transfect the desired target cells, e.g., muscle, liver, 
epithelial, etc. and provide sufficient levels of 
transfer and expression of the transgene to provide a 
therapeutic benefit without undue adverse or with 
medically acceptable physiological effects which can be 
determined by those skilled in the medical arts. 
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Conventional and pharmaceutically acceptable routes of 
administration include direct delivery to the muscle or 
other selected cell, intranasal, intravenous, 
intramuscular, subcutaneous, intradermal, oral and other 
5 parental routes of administration. Routes of 
administration may be combined, if desired. 

Dosages of-rAd will— dependprimarily-onf actors such 

as the condition being treated, the age, weight and 
health of the patient, and may thus vary among patients. 
10 For example, a therapeutically effective human dose of 
the rAd is generally in the range of from about 20 to 
about 100 ml of saline solution containing concentrations 
of from about lxlO 9 to lxio 11 pfu/ml virus. A preferred 
human dose is estimated to be about 50 ml saline solution 
15 at 2xl0 10 pfu/ml. The dose will be adjusted to balance 
the therapeutic benefit against any side effects. The 
levels of expression of the transgene can be monitored to 
determine the frequency of administration. 

An optional method step involves the co- 
20 administration to the patient, either concurrently with, 
or before or after administration of the rAd of a 
suitable amount of a short acting immune modulator. The 
selected immune modulator is defined herein as an agent 
capable of inhibiting the formation of neutralizing 
25 antibodies directed against the recombinant vector of 

this invention or capable of inhibiting or substantially 
delaying cytolytic T lymphocyte (CTL) elimination of the 
vector. Among desirable immune modulators are 
interleukin-12 [European Patent Application No. 441,900]; 
30 gamma interferon [S. C. Morris et al, J. Immunol.-# 

152 :1047 (1994)]; interleukin-4 [United States Patent No. 
5,017,691]; antibody to the CD4 protein, such as anti-OKT 
3+ [see, e.g., US Patent No. 4,658,019] or antibody GK1.5 
(ATCC Accession No. TIB207) ; a soluble CD40 molecule or 
35 an antibody to CD40 ligand (Bristol-Myers Squibb Co) 
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[European patent application 555,880, published August 
18, 1993]; a soluble form of B7 or an antibody to CD28 or 
CTLA4 [CTLA4— Ig (Bristol-Myers Squibb Co) , European 
patent application 606,217, published July 20, 1994], or 
5 agents such as cyclosporin A or cyclophosphamide. Thus, 
the pharmaceutical compositions and methods of this 

invention— provide a desirable gene therapy—' treatment. 

IV. Recombinant Adeno— Associated virus 
In the following context the term "transgene" means 
10 a nucleic acid sequence or reverse transcript thereof, 
heterologous to the AAV sequence, which encodes a 
polypeptide or protein of interest. The transgene may be 
operatively linked to regulatory components in a manner 
which permits transgene transcription, i.e., the 
15 transgene is placed into operative association with a 

promoter, as well as other regulatory sequences, such as 
SV40 introns or polyA sequences, useful for its 
regulation. The composite association of the transgene 
with its regulatory sequences is referred to herein as a 
20 minicassette or minigene. 

The composition of the transgene or minicassette 
sequence will depend upon the use to which the resulting 
rAAV will be put. For example, one type of transgene 
sequence includes a reporter sequence, which upon 
25 expression produces a detectable signal. Such reporter 
sequences include without limitation, an E. coli B- 
galactosidase (LacZ) cDNA, an alkaline phosphatase gene 
(ALP) and a green fluorescent protein gene. These 
sequences, when associated with regulatory elements which 
30 drive their expression, provide signals detectable by 
conventional means, e.g., ultraviolet wavelength 
absorbance, visible color change, etc. 

Another type of transgene sequence includes a 
therapeutic gene which expresses a desired gene product 
35 in a host cell. These therapeutic nucleic acid sequences 
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typically encode products for administration and 
expression in a patient In vivo or ex vivo to replace or 
correct an inherited or non-inherited genetic defect or 
treat an epigenetic disorder or disease. Such transgenes 
5 may be readily selected by one of skill in this art and 
the design of the transgene or the minicassette for 

insertionintothe rAAV is- not- a- limitation of this 

invention. 

The term "rAAV" encompasses any recombinant AAV gene 
10 therapy vehicle of the prior art, including the AdAAV 

hybrid virus described in published International Patent 
Application Ho. W096/13598, published May 9, 1996. More 
specifically, rAAV defines a rAAV comprising: (a) the DNA 
of at least a portion of the genome of an AAV, which 
15 portion is capable of transducing into a target cell at 
least one selected gene in the absence of cell division; 
and (b) at least one selected gene (or transgene) 
operatively linked to regulatory sequences directing its 
expression, the gene flanked by the DNA of (a) and 
20 capable of expression in the target cell in vivo or In 
vitro. 

Other rAAVs have been described in the art. The 
method of this invention is not limited by the precise 
nature of the AAV sequences used in the rAAV, provided 
25 that at a minimum both the 5 ' and 3 • AAV inverted 

terminal repeats are present. Thus, the rAAV may be 
selected by one of skill in the art, and is not itself a 
limitation on this invention. The rAAVs specifically 
disclosed herein are illustrative. 

30 By the term "transduction" is meant that the rAAV 

produced by practice of the invention is capable of 
infecting a desired target cell and expressing the 
transgene in the cell by harnessing the cell's machinery. 
Transduction may include stably integrating the viral DNA 
35 into a chromosome of the target cell. "Enhanced 
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‘transduction" is defined as the ability of the rAAV in 
the presence of a conversion agent to transduce the 
target cell, either in vitro, ex vivo or in vivo, at an 
efficiency greater than a typical prior art rAAV produced 
5 in, and purified from, a culture co-infected with an 
adenovirus or herpesvirus helper. 

Thismethod is based on the observation thatthe 

limiting step in rAAV mediated transduction of cells for 
gene therapy is not the internalization or transfer of 
10 the ss viral genome, but rather the subsequent conversion 
of the single-stranded (ss) viral genome to a 
transcriptionally active double-stranded (ds) form. 
Formation of ds DNA intermediates is necessary for 
recombinant gene expression, which is likely to be 
15 modulated by viral and cellular factors through 

posttranscriptional mechanisms. The inventors have 
designed a method to overcome this rate-limiting step, 
thereby enhancing transduction ability of an rAAV and 
ultimately the use of rAAV in gene therapy protocols. 

20 This method of the present invention may employ a 

conventionally prepared ss rAAV containing a transgene. 
The prior art produces ss rAAV by co-infection in culture 
with a helper adenovirus or herpesvirus, followed by 
purifying the rAAV from the culture contaminants 
25 including the helper virus, and infecting the target cell 
with the rAAV alone. The present invention provides for 
infecting a target cell with a ss rAAV. However, once 
the target cell is infected, the infected cell is 
contacted with an agent which facilitates the conversion 
30 of the ss rAAV to the ds form of rAAV. The action of 
this "facilitating agent" or "conversion agent" causes 
the ss to ds conversion to occur in the target cell, 
resulting in enhanced transduction of the recombinant AAV 
into the target cell. By facilitating the conversion of 
ss to ds rAAV in the target cell, the method of this 
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invention may also result in both transduction and stable 
chromosomal integration of the rAAV into the chromosome 
of said host cell. 

Preferably, for use of this invention the 
5 "facilitating or conversion agent" may take several 
forms. 

A. The Conversion Agen't -is a Helper Vims 

In one embodiment, the agent is a helper virus 
and the method includes an additional step of co- 
10 infecting the target cell with the helper virus. The 
helper virus useful in this method contains a selected 
gene which can facilitate the conversion of ss rAAV to ds 
rAAV. The selected gene may encode a gene product or 
polypeptide (or a functional fragment of the polypeptide 
15 which shares the biological activity of the full-length 
polypeptide) which enhances the conversion. 

Alternatively, the selected gene may express an antisense 
or ribozyme which functions in the cell to block or 
inhibit a cellular gene that normally prevents ss to ds 
20 conversion of the rAAV. These genes may also be employed 
in the second generation rAAV described below. 

The helper virus is capable of expressing the 
selected gene product in the target cell in the absence 
of cell division. The helper virus may be a wild-type or 
25 mutant adenovirus. The helper virus may alternatively be 
a wild-type or mutant herpesvirus. Preferably, for use 
as facilitating agents, such viruses are mutants deleted 
of several normal genes so that the helper viruses and/or 
their expressed gene products will not cause disease in a 
30 patient. 

For example, a helper adenovirus useful in this 
invention may express only a gene product of a single 
adenoviral early gene. Exposure of the ss rAAV to an Ad 
early gene product is sufficient to substantially enhance 
the formation of ds rAAV genome with a coordinate 
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increase in transduction efficiency. The Ad early genes 
which are useful in producing this effect are El, E2a, E4 
and functional fragments thereof* However, as 
demonstrated by the examples below , adenovirus 
5 s ub stantially enhances recombinant AAV transduction in 
vitro in a way that is dependent on expression of the El 

andE4genesof adenovirusand is directly— proportional — 

to the appearance of ds replicative forms of rAAV • 

One example of a helper virus is an adenovirus 
10 deleted of most of its wild-type early genes and which is 
capable of expressing only its E4 gene or a functional 
fragment thereof in the target cell. Among such 
functional fragments is the ORF 6 of the E4 gene* As 
described below in the examples, experiments in cell 
15 lines indicate that the ORF6 of the adenoviral E4 gene 
locus is sufficient to significantly enhance rAAV 
transduction. Selective expression of the E4-ORF6 
product of adenovirus accomplishes a increase in 
transduction efficiency similar to, but somewhat 
20 attenuated, compared to that produced by exposure to the 
El and E4 gene products in combination. That is, the 
ORF6 product of E4 is sufficient to enhance the 
augmentation of rAAV transduction; but this effect is 
amplified substantially by El gene products* 

25 Thus, more preferably, exposure of the rAAV to 

both the expressed El and E4 gene products produces a 
substantial enhancement of the above— described rate 
limiting step. Therefore, another exemplary helper virus 
may also contain more than one gene which, upon 
30 expression, facilitates the ss to ds conversion. An 
example of such a helper virus is an adenovirus which 
expresses both the El and E4 genes, or functional 
fragments thereof* Still other Ad genes may be expressed 
by the helper virus, provided that the virus is 
sufficiently crippled so that it does not cause disease 
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in the patient contributing the target cells. 

Where the agent which facilitates conversion of 
ss to ds rAAV is a helper virus, the method of the 
invention comprises co— infecting the target cell with the 
5 rAAV and the helper virus. Such co-infection may occur 
in the context of ex vivo therapy, i.e., manipulations 

performed on cells extractedf romthe patient , which 

cells are reinserted into the patient after the method is 
performed. Alternatively, the patient may be directly 
10 co-infected with the two viruses by conventional means. 
Delivery of the two viruses to the patient may be 
directed to a specific organ, or to the general 
circulatory system. Such delivery methods are described 
in the art for gene therapy of, e.g. , cystic fibrosis 
15 [see, e.g., us Patent No. 5,240,846]. 

B. The conversion Agent is a Chemical, Drug 
or Other Entity that can Activate rAAV Transduction 

In another embodiment of the method of this 
invention, the conversion agent which contacts the cells 
20 infected with the rAAV may be selected from the following 
classes of known compounds or methods: 1) inhibitors of 
DNA synthesis such as hydroxyurea, hydrogen peroxide, and 
other direct or indirect inhibitors of DNA polymerase; 2) 
chemo- therapeutic agents that induce DNA deunage, such as 
25 cyclophosphamide, alkylating agents, purine analogs, 

e.g., 6-thioguanine, etc.; 3) drugs that interfere with 
DNA modifying enzymes, such as inhibitors of 
topoisomerase , DNA ligase exonucleases and endonucleases; 
and 4) agents that nonspecif ically enhance transcription, 
30 such as sodium butyrate, or agents that stabilize cells, 
such a6 DHSO. Also, genotoxic agents such as carcinogens 
may be employed as the conversion agent. Other methods 
of inducing disruption or damage to DNA may also be 
useful as agents capable of facilitating ss to ds 
conversion of rAAV and maybe selected by one of skill in 
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the art, including physical methods, such as irradiation. 
These classes of compounds or methods are believed to 
result in the conversion from ss to ds rAAV. 

According to this embodiment of the method of 
5 the invention, the rAAV is again produced conventionally, 
but not co- infected with a helper virus. The ss rAAV is 

infectedintothetargetcell, and the infectedcellis — 

contacted by the agent in an appropriate manner depending 
on the identity of the agent. These conversion enhancing 
10 agents can be employed in ex vivo treatment of the target 
cells infected by the rAAV by application directly to the 
cells. Such application can occur substantially 
simultaneously, or consecutively, with application of the 
rAAV gene therapy vehicle. For example, the infected 
15 target cell may be subjected to one of the above— listed 
compounds or drugs for a desired time period. The 
parameters for contacting the infected cells with the 
agent may readily be determined by one of skill in the 
art. These parameters will depend upon whether the 
20 method is performed ex vivo or in vivo . For example, the 
number of ex vivo infected cells to be treated will be 
considered for the dosage, and timing of such treatment. 

Similarly, the physical status of the patient 
can determine the parameters of delivery of the agent to 
25 the patient in vivo. The dosage and amount of the 

damaging agent may therefore be adjusted by one of skill 
in the art. Where the agents are typical 
chemotherapeutic drugs approved for use in humans or 
animals, such enhanced conversion of rAAV may also occur 
30 in vivo by the co-administration of the agent, i.e., the 
chemotherapeutic drug, and the rAAV gene therapy vehicle 
to the patient. According to this aspect of the 
invention, the chemotherapeutic drug would be 
administered only when the rAAV is administered. 
Appropriate dosages and amounts of chemotherapeutic drugs 
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and recombinant: gene therapy vehicles and means for 
determining such amounts are within the skill of the art. 
However , because the effect of the chemotherapeutic drug 
will enhance the ss to ds conversion of the rAAV and thus 
5 enhance its efficiency of transduction into the target 
cells, it is anticipated that lower dosages than the 

conventional - dosagesof - eitherorboththedrugandthe 

rAAV could be effectively administered. 

C. Conversion Agent. May be Part of the rAAV • 
10 In still another embodiment of this invention, 

a novel "second generation" rAAV may be designed to 
incorporate the conversion agent into the virus, so that 
both the transgene and the conversion agent are co- 
expressed in the target cell. Such a novel recombinant 
15 adeno-associated virus comprises the following 
components: 

(a) the DNA of at least a portion of the 
genome of an adeno- associated virus which portion is 
capable of transducing at least two selected genes or 
20 functional fragments thereof into a target cell in the 
absence of cell division; (b) a first selected gene, 
i.e., the desired transgene, operatively linked to 
regulatory sequences directing its expression, and (c) a 
second selected gene, i.e., the "conversion gene" 

25 operatively linked to regulatory sequences capable of 
directing expression of said second gene. The 
"conversion gene" upon expression is capable of 
facilitating the conversion of the ss rAAV to its ds form 
upon expression. The first and second genes in this rAAV 
30 are flanked by the AAV DNA, preferably the S' and 3* 

ITRs. An embodiment of such a second generation rAAV is 
provided schematically in Fig. 11 • Zt,s DNA sequence is 
provided in SEQ ID NO: 5. 

Another embodiment of such a novel rAAV may 
35 include more than one gene which upon expression has the 
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ability to facilitate conversion of ss to ds rAAV in the 
target cell. For example, the novel rAAV described above 
may al6o contain an additional selected gene operatively 
linked to regulatory sequences capable of directing its 
5 expression, the additional gene and said second 

"conversion" gene described above being capable of 

jointly— facilitating— the-conversion-of—ss-rAAV—to-its-ds_ 

form upon expression of both the second and additional 
genes. In this rAAV, all three genes, i.e. , the 
10 transgene, the second "conversion" gene and the 
additional gene are flanked by the AAV DNA. 

In one desirable embodiment of a novel rAAVm 
the AAV ITRs flank a selected transgene, and a conversion 
gene, which is the adenovirus E4 gene or a functional 
15 fragment thereof (e.g., the ORF6 sequence). In another 
embodiment, the novel recombinant expresses three genes, 
the transgene, the adenovirus E4 gene or a functional 
fragment thereof and the adenovirus El gene or a 
functional fragment thereof. The E4 and El gene products 
20 expressed in the target cell with the transgene, together 
act to facilitate conversion of the ss to ds form of 
rAAV. 

In still another embodiment of the novel rAAV 
and its use, the regulatory sequences directing 
25 expression of the conversion gene, e.g., whether it be a 
single second gene or more than a single additional gene, 
may include an inducible promoter. Thus, expression of 
the conversion gene occurs only in the presence of an 
inducing agent. Many inducible promoters and companion 
30 inducing agents, e.g., steroids such as glucocorticoids, 
are known to the art and may be readily selected for 
incorporation into the rAAV and methods of this invention 
by one of skill in the art with resort to this 
description. 

The method of the invention employing such 
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"second generation" rAAVs which carry at least one 
"conversion gene" provides for infecting the target cell 
with this ss rAAV. Where the promoters directing 
expression of both the transgene and the conversion gene 
5 are constitutive, the infected target cell machinery will 
direct the expression of the transgene product and 

conversion - gene-products — Co-expression— in-the-target 

cell of the transgene and the "conversion gene" 
facilitates the conversion of ss rAAV to ds rAAV in the 
10 cell, and increases the transduction efficiency, and 

perhaps stable chromosomal integration, without further 
method steps. 

When the second generation rAAV employed in the 
method contains the "conversion gene(s)" under the 
15 control of inducible promoter (s) , the method is slightly 
altered. Following infection of the target cell by the 
rAAV, the infected target cell is contacted with a 
suitable inducing agent, which triggers the inducible 
promoter to "turn on" production of the conversion gene 
20 product. When the inducing agent is removed or stopped, 
the expression of the conversion gene product is "turned 
off". 

As described above, any prior art rAAV 
containing a transgene for gene therapy may be used in at 
25 least one embodiment of the above methods. The sources, 
selection and assembly of the various components to 
generate the rAAV, including the novel rAAV described 
above, are now conventional and readily accessible to one 
of skill in this art, given the disclosure contained 
30 herein. Such methods employ conventional genetic 

engineering techniques [See, e.g. Sambrook et al, cited 
above] . 

The novel rAAV viruses and the methods of this 
invention provide efficient gene transfer vehicles for 
35 somatic gene therapy and are suitable in pharmaceutical 
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compositions for ex vivo applications and in vivo use. 
When rAAV contain a therapeutic gene, e.g., in place of 
the LaeZ transgene illustrated in the exemplary rAAV , 

AV . CMVLacZ , by use of the rAAV and the methods described 
5 herein, the therapeutic transgene can be delivered to a 
patient in vivo or ex vivo to provide for efficient 

transduction, andpossiblystableintegration, ofthe 

desired gene into the target cell. Thus, these novel 
rAAV and the methods described herein can be employed to 
10 correct genetic deficiencies or defects. The potential 
of AAV to efficiently integrate its genome into 
nondividing cells is currently being exploited in the 
development of gene therapies based on ex vivo 
transduction of hematopoietic stem cells. In vivo 
15 application of rAAV is primarily being developed for the 
treatment of CF where purified stocks of virus are 
instilled into the airway to transduce the terminally 
differentiated epithelial cells of conducting airway. 

The methods and compositions described herein can be used 
20 with both types of gene therapy. Another condition 

suitable for such use includes transduction of the low 
density lipoprotein (LDL) receptor gene into hepatocytes 
for the treatment of familial hypercholesterolemia. One 
of skill in the art can generate any number of rAAV which 
25 can be used via the above methods for the treatment of 
these and other disorders. 

For ex vivo or for in vivo therapy, the rAAV 
may be used to infect the target cells by suspending the 
virus particles in a biologically compatible solution or 
30 pharmaceutically acceptable delivery vehicle. A suitable 
vehicle includes sterile saline. Other aqueous and non— 
aqueous isotonic sterile injection solutions and aqueous 
and non-aqueous sterile suspensions known to be 
pharmaceutically acceptable carriers and well known to 
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those of skill in the art may be employed for this 
purpose. 

The rAAV are administered in sufficient amounts 
to transfect the desired cells and provide sufficient 
5 levels of expression of the selected transgene to provide 
a therapeutic benefit without undue adverse, or with 
medicairY^acceptable,“ physiological ef fects which can be 
determined by those skilled in the medical arts. 
Conventional and pharmaceutically acceptable routes of in 
10 vivo administration include direct delivery to the target 
organ, tissue or site, intranasal, intravenous, 
intramuscular, subcutaneous, intra*3ermal , oral and other 
parental routes of administration. Routes of 
administration may be combined, if desired. 

15 Dosages of the rAAV for the infecting step of 

the method will depend primarily on factors such as the 
therapeutic environment, i.e., ex vivo or in vivo ; the 
condition being treated, the selected gene, the age, 
weight and health of the patient, and may thus vary among 
20 patients. A therapeutically effective dosage of the rAAV 
for ex vivo treatment will be based upon the multiplicity 
of infection, which is likely to range from between about 
1 to about 10 transducing particles/cell . A 
therapeutically effective human dosage of the rAAV for in 
25 vivo infection according to the present invention is 

believed to be in the range of from about 20 to about 50 
ml of saline solution containing concentrations of from 
about 1 x 10 7 to 1 x 10 10 transducing viral particles/ml 
virus. A preferred human dosage is about 20 ml saline 
solution at the above concentrations. The dosage will be 
adjusted to balance the therapeutic benefit against any 
side effects. The levels of expression of the selected 
gene can be monitored to determine the selection, 
adjustment or frequency of dosage administration. 
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The effective amount of the facilitating agent 
to be administered is within the skill of the art to 
determine and will depend upon the identity of the agent. 
Known dosages of certain of the classes of chemicals and 
5 pharmaceuticals described above may be employed in this 
method to deunage the DNA and facilitate ss to ds 

conversion of the rAAV. — Where the agent is a gene 

expressed by a helper virus, the amounts of infecting 
virus should be similar to those amounts described above 
10 for the rAAV. Of course, where the agent is a gene 
present in a second generation rAAV, the identical 
dosages described above for the rAAV will apply. 

Several embodiments of the above-described 
methods of this invention were confirmed in murine models 
IS of rAAV mediated gene transfer to both lung and liver. 
These experiments demonstrated similarly low levels of 
gene transfer an vivo by rAAV, which was increased 
several orders of magnitude by coinfection with £1 and E4 
expressing adenovirus. 

20 In summary, experiments were conducted to 

demonstrate that adenovirus enhances rAAV transduction in 
cultured cells. During the production and 
characterization of a lacZ recombinant AAV generated in 
293 cells that were coinfected with an El deleted virus, 
25 it was observed that purification of rAAV from lysates 

was associated with substantial loss of lacZ transducing 
activity when assayed on 293 cells. This drop in rAAV 
activity was particularly evident in the final step where 
residual contaminating helper adenovirus was removed by 
heat inactivation. LacZ transducing activity was 
recovered by adding adenovirus back to the purified stock 
of rAAV. These data provided the first indication that 
adenovirus could substantially enhance the transduction 
efficiency of rAAV. 
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As described in Example 10, a series of 
complementation groups were generated by mixing different 
adenovirus early gene mutants with purified LacZ rAAV, 
referred to as AV.CMVZ.ac2 (see Example 2). These defined 
5 mixtures of viruses were analyzed for LacZ transduction 
on Hela cells (See Examples 12 and 13) . An El deletion 

rAd H5.CBALP andtheE4 deletion mutant dll004provided 

no significant increase in AV.CMVlacZ transduction (Fig. 
5A) . However, partial activity could be achieved with El 
10 and E4 mutants that carried less severe deletions. Both 
dlllO (E1B— 55kDa deleted) and dllOlO (0RF6 deleted) 
enhanced transduction to levels that approached those of 
Ad5, tsl25 , and dl802 in terms of the number of positive 
blue cells, but total B-galactosidase activity was 
15 substantially lower (Fig. 5A) . These results implicate 
early regions El and E4 in the augmentation of rAAV 
transduction. 

The experiments described below also 
demonstrate that the novel rAAV which incorporates as its 
20 conversion gene, an Ad gene, such as E4, can increase 

transduction efficiency of the rAAV in the absence of a 
helper virus. As described in more detail below in 
Example 15 below, 293 cells were stably transfected with 
a genomic fragment of Ad5 spanning E4. This E1/E4 
25 expressing cell line and the parent El expressing cell 
line (293) were infected with rAAV and analyzed for 
transduction. These experiments demonstrated the 
significance of the combined expression of El and 
E4(0RF6) in the adenovirus mediated augmentation of rAAV 
30 transduction. 

In the presence of El and E4 expression, rAAV 
transduction was invariably accompanied by the appearance 
of ds RF monomers and dimers (Example 14) . Importantly, 
the tight correlation between rAAV vector transduction 
35 and the accumulation of duplex forms could be achieved in 
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two different experimental settings; cells infected with 
E1/E4 expressing adenovirus (Figs. 8A and 8B) , or 
complementing cell lines (Fig. 8C) . 

The following examples illustrate the 
5 construction and testing of the novel packaging cell 

lines, the E1/E4 deleted rAd of the present invention and 

the-use— thereof ,—improved-methods_and_second_generation 

recombinant AAV production for gene therapy of the 
present invention. These examples are illustrative only, 
10 and do not limit the scope of the present invention. 

Example 1 - Novel Ela/Elb a nd E4 Expressing Packaging 

gen Lines 

A. Construction of E4 OKF 6 Expressing Plasmids 
The entire E4 region from Ad5 or an ORF6 
15 minigene were subcloned into a shuttle plasmid that 

contained a neomycin resistance gene. Two versions of 
ORF6 minigene were developed that differed in the 
promoter element. The first used a Zn+2 inducible sheep 
metallothionine (MT) promoter to drive ORF 6 expression. 

20 The second used a dexamethasone-inducible mouse mammary 
tumor virus (MMTV) promoter. 

An exemplary plasmid useful for the 
construction of a packaging cell line of this invention 
is pMMTVE40RF6. The minigene contained in this plasmid 
25 is set out in SEQ ID NO: 1, and contains a mouse mammary 
tumor virus promoter (MMTV) (nucleotides 1—1506 of SEQ ID 
NO:l) in transcriptional control of a human E4 ORF 6 gene 
sequence (nucleotides 1523-2408 of SEQ ID NO: 1) , a 
growth hormone terminator (GH) (nucleotides 2409—3654 of 
30 SEQ ID NO: 1) , an SV40 origin of replication, plasmid 
sequences from plasmid pBR322, including a neomycin 
resistance gene, and an ampicillin resistance gene. The 
amino acid sequence of ORF 6 is indicated in SEQ ID NO: 

2. The various functional fragments of this plasmid may 
be readily replaced with other conventionally used 
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sequences and are not critical to the design of the 
plasmid. 

Another plasmid useful for the construction of 
a packaging cell line of this invention is pMTE40RF6. 

5 The DNA sequence of the minigene contained in this 

plasmid is similar to that of SEQ ID NO: 1, except that 

the promoter is a sheep metallothionine promoter — (MT 

promoter) [M. G. Peterson et al, cited above]. 

A plasmid used as a control for the 
10 construction of a packaging cell line of this invention 
is pI/TR. E4 (-) . This plasmid contains the endogenous 
constitutive retroviral MLV LTR and most of the Ad E4 
gene region except that the endogenous E4 promoter and a 
portion of E4 ORFl are missing. The other plasmid 
15 sequences remain the same as described above. 

Still another plasmid useful for the study of 
the methods of this invention is pLTR.E4 , which contains 
the constitutive MLV LTR and endogenous E4 promoter and 
an intact E4 gene. The other plasmid sequences remain 
20 the same as described above. 

To determine whether ORF6 expression was 
sufficient to enhance rAAV transduction, the inducible 
metal lothionein (MT)-ORF6 minigene was stably transfected 
into HeLa cells. This new cell line, HeLa (MT— ORF6 ) was 
25 evaluated for LacZ rAAV transduction in response to ORF6 
induction as described below. The cell line 293 
(MT-ORF6) expresses ORF-6 of the E4 gene of Ad5 from the 
metallothionine promoter which is relatively inactive at 
baseline but can be induced with divalent cations. These 
30 293 cells were included to establish the baseline 

transduction efficiency* 

B* Transfections and Selection of Clones 

Each of the above— described plasmids was 
transfected by the calcium phosphate precipitation 
technique into the human embryonic kidney cell line 293 
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[ATCC CRL1573 ] which expresses the product of the 
adenovirus El genes, or into HeLa cells, seeded on loo mm 
plates (10 pg plasmid/ plate) . Twenty four hours post- 
transfection, cells were harvested and seeded at varying 
5 dilutions (1:10 - 1:100) in 100 mm plates for about 10 
days. Seeding media contain G418 (Geneticin, BRL) at 1 

— mg/ml. — Resistant-colonies— that_developed_were_selected 

using the following assays and expanded. Preliminary 
analysis of clones was based on enhanced transduction 
10 efficiency of a recombinant adeno-associated virus, 

AV . CMVXacZ, and immunofluorescence localization of Ad E4 
protein as described in the following examples. 

Example 2 ~ Recombinant AAV and AV.CMBZ.acg Transduction 
Enhancement Assay 

15 El and E4 Ad gene products are needed for 

recombinant adeno— associated virus (AAV) function. This 
primary assay involves seeding the packaging cell lines 
of Example 1 in 96 well 35 mm culture plates (2x10* 
cells/well) and infecting the cells with purified, heat- 
20 treated AV.CM VLacZ at an MOI of 1000 virus 
particles/cell. 

A. Preparation of Recombinant AV .CMVLacZ 

A recombinant AAV virus was prepared by 
conventional genetic engineering techniques for the 
25 purposes of this experiment. Recombinant AAV was 

generated by plasmid transfections in the presence of 
helper adenovirus [Samulski et al, J. Virol. . 63:3822- 
3828 (1989)]. A cis-acting plasmid pAV . CMVLacZ [SEQ ID 
NO: 4] (see Fig. 10) was derived from psub201 [Samulski 
30 et al, J. Virol. . jy.: 3096-3101 (1987)] and contains an E. 
coli B— galactosidase minigene in place of AAV Rep and Cap 
genes. The 5' to 3* organization of the recombinant 
AV . CMVLacZ genome (4.9 kb) [SEQ ID NO: 4] includes 
(a) the 5* AAV ITR (bp 1-173) was 
obtained by PCR using pAV2 [C. A. Laughlin et al. Gene . 
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23 i 65-73 (19B3) ] as template [nuc. 53-219] ; 

(b) a CMV immediate early 
enhancer/promoter [Boshart et al, Cell . £1:521-530 
(1985) ] (nuc. 246-839) ; 

5 (c) an SV40 intron (nuc. 856-987) ; 

(d) E . coll 8-galactosidase cDNA (nuc. 

1039-4512) ; 

(e) an SV40 polyadenylation signal (a 237 
Bam HI -Bell restriction fragment containing the 

10 Cleavage/poly-A signals from both the early and late 
transcription units (nuc. 4522-4719) and 

(f) 3* AAV ITR, obtained from pAV2 as a 
SnaBI-Bglll fragment (nuc. 4759-4925) . All other 
nucleotides are plasmid derived. 

15 Rep and Cap genes were provided by a trans- 

acting plasmid pAAV/Ad [Samulski et al , cited above]. 

Monolayers of 293 cells grown to 90* confluency 
in 150 mm culture dishes (5xl0 7 cells/plate) were 
infected with H5.CBALP at an MOI of 10. H5.CBALP (also 

20 called H5.010ALP) is a rAd that contains an alkaline 

phosphatase minigene in place of adenovirus Ela and Elb 
gene sequences (map unit6 1-9.2 of the Ad5 sequence of 
GenBank [Accession No. M73260]). The alkaline 
phosphatase cDNA is under the transcriptional control of 

25 a CMV— enhanced B-actin promoter in this virus. This 

helper virus is described in Goldman et al , Hum. Gang 

Ther. . 6:839-851 (July, 1995); Engelhardt et al, flym. 

Gene Ther. . 5:1217-1229 (October, 1994); and references 
cited therein. 

30 Infections were done in Dulbecco's Modified 

Eagles Media (DMEM) supplemented with 2* fetal bovine 
serum (FBS) at 20 ml media/150 mm plate. Two hours post- 
infection, 50 pg plasmid DNA (37.5 pg trans-acting and 
12.5 pg cis— acting) in 2.5 ml of transfection cocktail 
was added to each plate and evenly distributed. 
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Transfections were calcium phosphate based as described 
[B. Cullen, Meth. Enzvmol. . 152 : 684-704 (1987)]. Cells 
were left in this condition for 10-14 hours after which 
the infection/ transfection media was replaced with 20 ml 
5 fresh DMEM/2% FBS. Forty to fifty hours post- 

transfection, cells were harvested, suspended in 10 mM 

Tris-Cl (pH 8.0)buffer (0.5 ml/150 mm plate) and a 

lysate prepared by sonication. The lysate was brought to 
10 mM manganese chloride, after which bovine pancreatic 
10 DNase I (20,000 units) and RNase (0.2 mg/ml final 

concentration) were added, and the reaction incubated at 
37 °C for 30 minutes. Sodium deoxycholate was added to a 
final concentration of 1% and incubated at 37°C for am 
additional 10 minutes. 

15 The treated lysate was chilled on ice for 10 

minutes and solid CsCl added to a final density of 1.3 
g/ml. The lysate was brought to a final volume of 60 ml 
with 1.3 g/ml CsCl solution in 10 mM Tris-Cl (pH 8.0) and 
divided into three equal aliquots. Each 20 ml sample was 
20 layered onto a CsCl step gradient composed of two 9.0 ml 
tiers with densities 1.45 g/ml and 1.60 g/ml. 

Centrifugation was performed at 25,000 rpm in a 
Beckman SW-28 rotor for 24 hours at 4°C. One ml 
fractions were collected from the bottom of the tube and 
25 analyzed on 293 or 293 (E4) cells for LacZ transduction. 

Fractions containing peak titers of functional AV . CKVLacZ 
virus were combined and subjected to three sequential 
rounds of equilibrium sedimentation in CsCl. Rotor 
selection included a Beckman NVT-90 (80,000 rpm for 4 
30 hours) and SW-41 (35,000 rpm for 20 hours). At 

equilibrium, AV.CM VLacZ appeared as an opalescent band at 
1.40-1.41 g/ml CsCl. Densities were calculated from 
refractive index measurements. Purified vector was 
exchanged to 20 mM HEPES buffer (pH7.8) containing 150 mM 
Nacl (HBS) by dialysis and stored frozen at -80 °C in the 
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presence of 10% glycerol or as a liquid stock at -20 °C in 
HBS/40% glycerol • 

Purified virus was tested for contaminating 
H5 «CBAIjP helper virus and AV.CMVXacZ titers. Helper 
5 virus was monitored by histochemical staining for 

reporter alkaline phosphatase activity. A sample of 

pur ified— virus— representing— 1.0 %— of — the— f ina 1 — product .was 

added to a growing monolayer of 293 cells seeded in a 60 
mm plate. Forty-eight hours later, cells were fixed in 
10 0.5% glut ar aldehyde/phosphate buffered saline (PBS) for 

10 minutes at room temperature, washed in PBS (3x10 
minutes) and incubated at 65 °C for 40 minutes to 
inactivate endogenous alkaline phosphatase activity. The 
monolayer was allowed to cool to room temperature, rinsed 
15 once briefly in 100 mM Tris-Cl (pH9.5)/100 mM NaCl/5mM 
MgCl, and incubated at 37°C for 30 minutes in the same 
buffer containing 0.33 mg/ml nitroblue tetrazolium 
chloride (NBT) and 0.165 mg/ml 5-bromo-4-chloro-3- 
indolylphosphate p-toluidine salt (BCIP) . Color 
20 development was stopped by washing the monolayer in 10 mM 
Tris-Cl (pH 8.0) /5 mM EDTA. Routinely the purif ication 
scheme described above removed all detectable H5.CBALP 
helper virus by the third round of buoyant density 
ultracentrifugation . 

25 AV.CM VLacZ titers were measured according to 

genome copy number (virus particles/ml) , absorbance at 
260 nn (A^o particles/ml) and LacZ Forming Units 
(LFU/ml) . Virus particle concentrations were based on 
Southern blotting. Briefly, a sample of purified 
30 AV . GMVLacZ was treated with capsid digestion buffer (50 
mM Tris-Cl, pH 8. 0/1.0 mM EDTA, pH 8. 0/0. 5% 

SDS/ Proteinase K 1.0 mg/ml) at 50°C for one hour to 
release virus DNA. The reactions were allowed to cool to 
room temperature, loading dye was added and 
35 electrophoresed through a 1.2% agarose gel. standard 
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quantities of ds AV.CM VLacZ genome were also resolved on 
the gel. 

DNAs were electroblotted onto a nylon membrane, 
hybridized with a W P random primer labeled restriction 
5 fragment, and the resulting blot scanned on a 

Phosphor Imager 445 SI (Molecular Dynamics) . A standard 

curve— was— generated— from— the-duplex—forms-and— used— to- 

extrapolate the number of virus genomes in the sample . 

LFU titers were generated by infecting indicator cells 
10 with limiting dilutions of virus sample . Indicator cells 
included HeLa and 293 and 293 (E4) lines (described in 
Example 10 below) . Twenty-four hours later, cells were 
fixed in glutaraldehyde and cells were histochemically 
stained for E. coll j3-galactosidase (lacZ) activity as 
15 described in J. M. Wilson et al, Proc. Natl. Acad. Sci. 
USA . £5:3014-3018 (1988). One LFU is described as the 
quantity of virus that is sufficient to cause visually 
detectable fi-galactosidase expression in one cell 24 
hours post-infection . 

20 B. Induction of ORF6 Expression 

Induction of ORF6 expression with 10 pK 
dexamethasone or 150/iM zinc sulfate (for negative 
control, no inducer used) was initiated 2 hours before 
the addition of virus and continued throughout the 
25 duration of the experiment. Twenty-four hours after the 
addition of virus, cells were harvested, lysates were 
generated by sonication and analyzed for the 6— 
galactosidase expression (i.e., fi-galactosidase activity) 
and virus DNA as described above. Hirt extracts were 
30 prepared from low molecular weight DNA from cell 

extracts. The preparation of the Hirt extracts and 
subsequent analysis by Southern hybridization were 
performed by resort to conventional procedures known to 
one of skill in the art. 

In the absence of the inducers, the packaging 
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cell lines generate lower levels of B— galactosidase in 
rAAV infected cells. Induction of ORF6 expression with 
the inducer dexamethasone results in a concomitant rise 
in KV.CMVLacZ cell transduction to a level that was much 
5 greater than the parent 293 line. Expression of El alone 
was insufficient to have an effect in the adenovirus 

— mediated- augmentation-of_r AAV— transduct ion . 

Results are demonstrated for certain positive 
clones in the Table I below (see Example 4). However, 

10 for 30 cell lines having an MMTV promoter and ORF6 

sequence, 4 demonstrated over 90% blue cells illustrative 
of LacZ production in the presence of dexamethasone, 
i.e., 293-27-6, 293-27-17, 293-27-18 and 293-27-28. 
Example 3 - Timnunof luorescence Loc alization of Ad5 Late 
15 Protein. 

Positive clones from the assay of Example 2 were 
infected with the recombinant E4 deleted adenovirus 
H5dll004 and screened for E4 complementation using an 
immunofluorescence assay for late gene expression. The 
20 H5dll004 virus was obtained from Dr. Ketner of Johns 

Hopkins University and is described in Bridge and Ketner, 
J. Virol. . 63 2 f 2 1 : 631—638 (Feb. 1989), incorporated by 
reference herein. Because ORF6 of E4 complements late Ad 
gene expression, specifically in the formation of the 
25 hexon and penton fibers of the adenovirus, cell lines 
containing ORF6 are able to bind with antibody against 
these proteins. 

Each cell line of Example 1 is infected with E4 
deleted virus H5dll004 virus at an MOI of 0.1. The cells 
30 were treated with mouse anti— adenovirus FITC— labeled 

monoclonal antibody to either the hexon or penton fibers 
in a 1:10 dilution (Chemicon International Inc., 

Temecula, CA) . Positive clones were identified by 
reaction with the antibody. 
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Example 4 - Relative Plaguing Efficiency 

The cell lines of Example 1 demonstrating with 
strong complementation ability in Example 3 were screened 
for relative plaquing efficiency of H5dll004 as compared 
5 to W162 cells (an E4 -complementing Vero cell line which 
does not express El) [Weinberg and Ketner , Proc. Natl. 

— Acad — Sci — USA — 80(171:5383-5386 (1983) 1 In_Table_II 

below, RPE%, i.e., relative plaquing efficiency, 
represents the titer of H5dll004 on tested cell 
10 , lines/titer of H5dll004 on W162 cells. For example, the 

KPE of 293 cells is 0. 

The positive cell lines selected by all criteria are 
identified in Table I below, with the results of the 
assays of Examples 2, 3 and 4. 

15 TABLE I 



E1/E4 Double Complementing Cell Lines 



Cell 

bine 


Trans- 
Sene 


Pro- 

moter 


XF/LP 


AV.CMV 

I^acZ 


BEES. 


293-10-3 


ORF6 


MT 


+ +++ 


++++ 


246 


293-39-11 


ORF6 


LTR 


++++ 


+++ 


52 


293-84-31 


E4- 


I/TR 


++++ 


++++ 


179 


293-12-31 


vhole 

E4 


IiTR 

+E4 


++++ 


++++ 


174 


293-27-6 


ORF6 


MMTV 




+++++ 


327 


293-27-17 


ORF6 


MMTV 




++++ 


313 


293-27-18 


ORF6 


MMTV 




+++-H- 


339 


293-27-28 


0RF6 


MMTV 




++++ 


261 



Example 5 - Construction and Purif ication of H5.001CBLacZ 
30 The plasmid pAd.CB LacZ was constructed as described 

in detail in K. Kozarsky et al, Som. Cell Mol . Genet.. 
12(5) : 449-458 (1993), incorporated by reference herein. 
This plasmid contained a minigene comprising a 5' 
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flanking Nhel restriction site, followed by Ad5 sequence 
m.u. 0*1, followed by an El deletion into which is 
inserted a CKV enhancer/ chicken 6-actin promoter sequence 
[T. A. Kost et al, Wucl. Acids Res. . 11(23) ;8287 (1983)], 

5 which controls the transcription of the following 

bacterial B-galactosidase, followed by a poly A sequence 

andflanked3'by Adm.u.9-16, and another Nhel_ site. 

In the plasmid, the minigene was flanked on both sides by 
plasmid sequence containing drug resistance markers. 

10 The plasmid pAd.CBLacZ was linearized with Nhel and 

co- transfected by the calcium phosphate co- transfect ion 
method into the novel packaging cell line of Example 1 
with Clal digested H5dll004 (an Ad5 sequence deleted of 
from about map unit 92.1 through map unit 98, 

15 corresponding to substantially the entire E4 gene) . 

Homologous recombination occurs in the cell line 
between these two viral constructs between Ad map units 
9-16, resulting in rAd, designated H5.001CB La.cZ [SEQ ID 
NO: 3] (Fig. 2). This rAd contains the sequence from 
20 pAd . CBLacZ (including Ad map units 0-1 (nuc. 1-330); CMV 

enhancer/ chicken B-actin promoter (CB) (nucs. 370-928) ; 

E . coll B-galactosidase (nucs. 945-4429); the polyA (nuc. 
4429-4628); and Ad5 map units 9-92.1 and 97.3 to 100 from 
H5dll004 (nucs. 4671-35408)). This rAd is thereby 
25 functionally deleted, and substantially structurally 
deleted, of the Ad El and E4 genes. 

Viral plaques were selected and screened by the B— 
galactosidase assay [Wilson (1988) , cited above] and 
H5.001CB LacZ was isolated following three rounds of 
plaque purification. The purified virus was also 
subjected to cesium chloride density centrifugation and 
large scale production. 

For the following mouse experiments, virus was used 
after column purification and glycerol was added to a 



30 




WO 96/39530 



PCT/US96/1 0245 



46 

final concentration of 10% (v/v) . Virus was stored at - 
70°C until use. 

Example 6 - Growth Kinetics of H5 . OOlCBLacZ in Packaging 
Cell Jj in .es 

5 The cell lines identified in Table I were infected 

with recombinant H5.001CB LacZ at an MOI of .5. The 

growth kinetics of this virus in t he E4 complementing 

cell lines are shown in Fig. 3. Maximum viral yield is 
reported as LFU/ml in Table XI below. 



10 


Cell Line 


TABLE II 

Maximum Viral Yield 




293-10-3 


2.8X10 10 




293-39-11 


9.5X10® 




293-84-31 


1.1X10 9 


15 


293-12-31 


4.5X10* 




293-27-6 


2.8X10 10 




293-27-17 


2 . 5xl0 10 




293-27-18 


2 . 9X10 10 




293-27-28 


1.2X10 10 


20 


When grown in 293 


-27-18 cells (the E4 ORF6 



with MMTV promoter inducible by dexamethasone) the 
maximum yield of this virus is 2.9X10 10 LFU/ml. Several . 
of the cell lines were passaged between 5 and 20 times 
and the viral production of the passages remained stable. 
25 However, RPE did fall following repeated passages of 
cells. 

Example 7 - Other Recombinant Adenoviruses 

Other related rAds were prepared similarly to 
H5. OOlCBLacZ by homologous recombination between 
30 pAdCBLacZ and other helper viruses. 

As one example, H5 . OOOCBLacZ is a recombinant El 
deleted Ad5 which contains the same minigene as 
H5 . OOlCBLacZ, but has an intact E4 gene. This rAd was 
prepared as described by homologous recombination between 
pAdCB LacZ and a wild-type Ad5. 
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As another example, H5. OlOCBLacZ contains the 
adenovirus map units 0-1, followed by a CMV enhanced, 
chicken cytoplasmic jS-actin promoter, the E . coli B- 
galactosidase gene (lacZ) , a polyadenylation signal (pA) , 
5 and adenovirus type 5 map units 9-100, with a small 

deletion in the E3 gene (the Ad 5 sub360 backbone) . This 

rAd-may_be_prepared__by_homologous recombin at ion_be tween 

the pAdCB LacZ vector and Ad5 virus sub360, which contains 
a 150 bp deletion within the 14.6 kD protein of the E3 
10 gene. See, e.g., J. F. Engelhardt et al, Proc. _Natl . 

Acad. Sci . . USA . 91:6196-6200 (June 1994); and Engelhardt 
et al, Hum. Gene Ther. . 5:1217-1229 (Oct. 1994), both 
incorporated by reference herein. 

These rAds were isolated following transfection 
15 [Graham, Virol. , 52:456-467 (1974)], and were subjected 
to two rounds of plaque purification. Lysates were 
purified by cesium chloride density centrifugation as 
previously described [Englehardt et al, Proc. Natl. Acad. 
Sci. USA . £8: 11192-11196 (1991)]. Cesium chloride was 
20 removed by passing the virus over a BioRad DG10 column 
using phosphate-buffered saline. 

Example 8 - LacZ Gene Transfer into House 

A. Transfer into Mouse Muscle 

Five to six-week old male C57B/6 mice were 
25 anesthetized. Anterior tibialis muscles were exposed and 
directly injected with either rAd H5.000CB LacZ, 

H5.010CB LacZ or H5.001CB LacZ as follows: 25 ;iL of 

purified viral suspension at a stock concentration of 5 x 
10 n virus particles/mL was injected by inserting the tip 
30 of the 33 gauge needle of a 100 /iL Hamilton syringe into 
the belly of the muscle. 

Animals were sacrificed on day 4, 14, 28 and 60 
post injection. The muscles were dissected and frozen in 
liquid nitrogen cooled isopentane. Six /xM sections were 
35 cut in a cryostat, fixed and stained for B-galactosidase 
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activity for 6 hours at 37 °C. 

While the blue stained rAd was found for each 
virus in the day 4 and day 14 (most abundant) stains , by 
day 28, the H5.001CB LacZ clearly demonstrated more virus 
5 on day 28. By day 60, the only virus which stained 
positive was the H5.001CB LacZ. 

B. Transfer into Mouse Lung and Circulation 

RAd H5.000CB LacZ (control), and H5.001CB LacZ 
(IxlO 11 viral particles) were administered to six week old 
10 C57BL/6 female mice by tail vein injection and trachea 

installation. The animals were sacrificed and their 
liver and lung tissues were harvested at days 4, 9, 21, 

28 and 35 post-administration. The transgene and viral 
late gene expression were compared. 

15 At therapeutic doses of virus, there was 

diminished expression of late viral proteins at all time 
points in comparison with transgene. 

C. Dose Responses In Liver 

Dose responses of E4-deleted and E4 intact rAds 
20 in the liver of C57BI./6 mice were studied by tail vein 
administration of 1.5X10 11 , 5xl0 10 , 1.7X10 10 , 5.6xl0 9 , and 
1.9xl0 9 viral particles and comparing the transgene and 
viral late gene expression at day 4, 21, 28, 35, and 42 
post administration. 

25 At therapeutic doses of virus, there was 

diminished expression of late viral proteins at all time 
points in comparison with transgene. 

Example 9 - Other Gene Transfers 
A. Human OTC Gene Transfer 
30 The human OTC gene [A. L. Horwich et al. 

Science . 224 : 1068-174 (1984)] or the human CFTR gene 
[Riordan et al. Science . 245 :1066-1073 (1989)] was used 
to replace the LacZ as the transgene in the recombinant 
E1/E4 deleted adenoviruses described above, using the 
techniques analogous for the construction of the above- 
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described LacZ vectors. 

The resulting human OTC-containing rAd were 
administered at an MOI of 10 to 30 to human hepatocytes. 
The E1/E4 deleted rAd demonstrated less replication and 
5 less late gene expression than when the E1/E4 deleted 
rAds are administered to muscle, as described in the 

example above However, the results of this gene 

transfer are better than comparable transfers with rAds 
containing only a deletion in the El gene or a deletion 
10 in the El gene and a point mutation in the E2a gene. 

Similar results are demonstrated when the 
transgene is CFTR and the method of administration is 
intratracheal into lungs. 

Example 10 - Transduction Efficiency of rAAV LacZ 
15 AV.CM VLacZ^ in HeLa Cells Infected with Ad Mutants 

A. Viruses 

The following viruses were employed in this 
experiment: 

(1) Wild-type Ad 5, propagated in 293 cells; 

20 (2) Ad dlllO (an Ad which is deleted of the 55 

kb E1B gene) [Babiss et al, J. Virol. . 52(2) :389-395 
(1984) and Babiss and Ginsberg, J. Virol. . !50(1) : 202-212 
(1984)], propagated in 293 cells, 

(3) H5.CBALP (an Ad deleted of its E1A and E1B 
25 genes and containing a minigene that expresses alkaline 

phosphatase from a CMV enhanced /3-actin promoter, as 
described above) , propagated in 293 cells, 

(4) Ad tsl25 (an Ad with a temperature 
sensitive mutation in the E2A gene which encodes the DNA 

30 binding protein) [Ensinger and Ginsberg, J . Virol . . 

10(3) :328-339 (1972)], propagated in 293 cells, 

(5) Ad dl802 (an Ad deleted of its E2a gene) , 
grown in E2A-complementing gmDBP cells as described in 
Rice and Klessig, J. Virol. ♦ jj<; (3) : 767-778 (1985); 
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(6) Ad dll004 (an Ad deleted of the E4 gene) , 
grown in E4 -complementing Vero W162 cells [Weinberg and 
Ketner, Proc. Natl. Acad. Sci. USA . 8£.(17) :5383-5386 
(1983)] and 

5 (7) Ad dlioio (an Ad deleted of ORF6 of its E4 

gene) , grown in E 4 -comp 1 ement ing Vero W162 cells 

[Weinberg -and -Ketner,— cited-above]-. 

All viruses were purified by two sequential 
rounds of buoyant density ultracentrifugation in CsCl. 

10 B. Experimental Procedures 

HeLa cells seeded in 6 well, 36mm culture 
plates (2x10 s cells/well) were infected with wild-type 
Ad5 or an adenovirus early gene mutant as described in 
Part A at an MOI of lOpfu/well. Infections were done in 
15 l.o ml DMEM/2%FBS. six hours post-infection, monolayers 
were washed and 1.0 ml fresh DMEM/2% FBS media containing 
AV.CM VLacZ at 4x10* virus particles /ml were added. 
Although the AV.CM VLacZ virus lot used in these 
experiments was shown to be free of H5 . C L ALP helper virus 
20 by histochemical staining, the virus sample was subjected 
to heat treatment (60°C for 20 minutes) prior to use to 
ensure the absence of contaminating adenovirus. Two 
hours later, 1.0 ml of DMEM/115% FBS was added to each 
well. 

25 Twenty-four hours after the addition of 

AV.CM VLacZ, cells were harvested. Each test condition 
was done in triplicate to enable virus transduction to be 
evaluated in terms of three outputs: histochemical 

staining for B-galactosidase activity (below) , 

30 intracellular B-galactosidase specific activity (Example 

11) , and the molecular form of the virus DNA (Example 

12 ) . 

HeLa cells were histochemically stained for E . 
coli /9-galactosidase (LacZ) activity as described in J . 

M. Wilson et al, Proc. Natl. Acad. Sci. USA . 85:3014-3018 
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(1988) . The different combinations that were tested 
included cells transfected with AAV vector alone 
(AV. CMVLacX) , vector plus wild-type Ad5 (+Ad5) , vector 
plus dlllO (-t-dlllO) , vector plus Ad mutant H5.CBALP 
5 (+H5 • CBALP) , vector plus Ad mutant ts!25 (+ts!25) , vector 

plus Ad mutant dl802 (+dl802) , vector plus Ad mutant 

dll004(+dll004), and -vector plus Ad-mutant dllOlO 

(+dll010) . 

The results were observed in photomicrographs 
10 at magnification 10X (not pictured) of histochemical 
stains for recombinant B-galactosidase activity. The 
results indicated that wild-type Ad5 and the E2a mutants 
tsl25 and dl802 caused a significant increase in LacZ 
rAAV transduction as measured by the number of positive 
15 blue cells and the degree of stain intensity. Both dlllO 
(E1B— 55kDa) and dllOlO (ORF6) enhanced transduction to 
levels that approached those of Ad5, tsl25, and dl802 in 
terms of the number of positive blue cells. 

The El deletion recombinant H5. CBALP provided 
20 no significant increase in AV.CMVLacZ transduction. 

Expression of El alone was insufficient to have an effect 
in the adenovirus mediated augmentation of rAAV 
transduction as evidenced by lack of significant increase 
in transduction obtained with HeLa cells infected with 
25 the E4 deletion mutant dll004. A significant drop in 

transduction occurred following removal of ORF6 from the 
E4 region from the coinfecting adenovirus (Fig. 5A) . 

It is believed that these results demonstrate 
that the adenoviral gene products , E4 and El indirectly 
30 promote the formation of ds DNA intermediates that are 
transcriptionally active. 

fiyaTnp] li - Quantitation of Enhan ced Vector Transduction 
(A) A duplicate set of HeLa cells as described in 
Example 10B were used in this experiment. Twenty-four 
35 hours after the addition of AV.CMVLac# recombinant , for 
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intracellular B-galactosidase assays, cell pellets were 
suspended in 0.5 ml PBS and sonicated. Cell debris was 
removed by centrifugation (15,000Xg for 10 minutes) and 
the clarified extract assayed for total protein [M. 

5 Bradford, Anal. Biochem. . 72(1-2) J248-254 (1976) and M. 
Bradford et al. Fed. Proc. . 2£(3) :274 (1976)] and B- 

galactosidase activity [Sambrook etal,cited above] 

using o-nitrophenyl fi-D-galactopyranoside (ONPG) as 
substrate. 

10 Fig. 5A demonstrates the transduction 

efficiency quantitated by measuring fi-galactosidase 
enzyme activity in the lysates from infected Hela cells 
and also assayed for total protein. In Fig. 5A, the test 
condition is shown along the horizontal axis, and 
15 intracellular B-galactosidase specific activity 

(milliunits/mg protein) using ONPG as substrate is 
plotted on the vertical axis. Below each bar, the 
fold-induction in specific activity relative to cells 
that received the A V.CMVLacZ vector alone is given. 

20 The results of Fig. 5A demonstrate that the E2a 

mutants tsl25 and dl802 produced 134-fold and 225-fold 
increases in B-galactosidase activity, respectively, as 
compared to that achieved with purified rAAV alone. In 
comparison, cells infected with wt Ad5 generated 107 -fold 
25 increase in B-galactosidase activity. 

(B) In another experiment, HeLa cells (2x10*) were 
infected with increasing multiplicities of wild-type Ad5 
or the E2 mutant dl802. Six hours post-infection, 
monolayers were washed and infected with AV.CM VLacZ at 
30 1000 virus particles/cell. Twenty-four hours after the 

addition of AV. CMVLacZ , cells were harvested and assayed 
for total protein and B-galactosidase activity. 

The results are illustrated in the bar graph of 
Fig. 5B, in which adenovirus NOI's are given along the 
horizontal axis, and intracellular B-galactosidase 
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specific activity along the vertical axis. Enhancement 
of rAAV transduction was proportional to input helper 
adenovirus from MOIs of 1 to 50 for both wild type Ad5 
and dl802. Higher doses of virus were cytopathic, 

5 leading to a fall in B-galactosidase expression. 

Enhanced transduction was achieved when the cells were 

infected prior to, or at the-time of, — rAAV infection. 

The El deletion recombinant H5.CBALP and the E4 deletion 
mutant dll004 provided no significant increase in 
10 AV.CMV.LacZ transduction. Both cells infected with dllio 
(E1B— 55kDa) and with dllOlO (ORF6) demonstrated 
substantially lower total fi-galactosidase activity than 
those infected with Ad5, tsl25, or dl802. 

Example 12 - Analysis of Low Mol ecular Weight DNAs_J.n 
15 AV.CMVb acZ Trans duced Cells 

Studies with these early gene mutants of adenovirus 
suggested that expression of adenoviral genes rather than 
the virion itself was responsible for enhancement of rAAV 
transduction. To further investigate these mechanisms 
20 and to determine if conversion of ss to ds genome limits 
the transduction efficiency of rAAV, the molecular state 
of the rAAV genome was characterized in the infected 
cells. The relationship between RFm formation and lacZ 
rAAV transduction was explored in experiments where the 
25 dose of coinfecting virus was varied (MOI*l, 5, or 10). 

(A) A duplicate set of HeLa monolayers as 
described in Example 10 were harvested 24 hours after 
they were transduced with the recombinant AV.CMVLacZ and 
cultured with or without helper adenovirus. 

30 Episomal DNA was extracted from cell pellets 

using a modification of the procedure originally 
described by B. Hirt, J. Mol. Biol. . ££:365-369 (1967) . 
Briefly, cells were suspended in 320 ml Tris-Cl 
(pH 8 .0) /10 mM EDTA and SDS added to a final concentration 
35 of 1%. The mixture was incubated at 37°C for 30 minutes. 




WO 96/39530 



PCT/US96/1 0245 



54 

Pronase and proteinase K were added to final 
concentrations of 500 /ug/ml and 20 pg/ml, respectively, 
and the reaction incubated at 37 °C for 2 hours. Sodium 
chloride was added to a final concentration of 1.1 H and 
5 incubated at 4°C overnight. The precipitate that 
developed during the 4°C incubation was pelleted at 

2 0,0 00— xg— f or-3 0-minut es-and-the-clear—supernatant 

carefully removed. The supernatant was extracted once 
with phenol: chloroform: isoamyl alcohol (25:24:1) followed 
10 by chloroform: isoamyl alcohol (24:1). Nucleic acids were 
precipitated with ethanol. The final pellet was 
suspended in 50 jil Tris-Cl (pH 8.0)/1.0 mM EDTA. 

These Hirt extracts were analyzed by Southern 
blot hybridization. Samples (5 Ml) of each Hirt extract 
15 were resolved through a 1.2% agarose gel, electroblotted 
onto a nylon membrane and hybridized with a 32 P random 
primer labeled cDNA of the SV40 polyA signal used in 
AV.CM VbacZ. 

An autoradiogram of the experiment of Example 
20 12 (not pictured) , identifies and labels bands 

corresponding to the ss AV.CMVLacZ genome (SS) , a monomer 
replicative form (RFm) , and concatomer replicative forms 
(RFd) . Bands corresponding to the ss AV.CMVLacZ genome 
(SS) , a monomer replicative form (RFm) , and concatomer 
25 replicative forms (RFd) were identified and labeled. To 
reference the RFm band, a plasmid carrying AV.CMV.LacZ was 
digested to release the entire genome. Autoradiogram 
exposure times were 14 hours and 69 hours. 

In this autoradiogram, the full spectrum of 
30 molecular species present during a lytic infection was 
demonstrated in cells infected with both La.cZ rAAV and 
wild type adenovirus. Both the input ss genome (SS) and 
monomeric and dimeric forms of ds replicative 
intermediates (RFm and RFd) are present. This contrasts 
with cells infected with purified rAAV alone, where ss 
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genome is the sole molecular form detected. Analysis of 
cells coinfected with the adenovirus early gene mutants 
revealed a direct correlation between formation of ds 
forms of the rAAV genome and the enhancement of LacZ 
5 transduction. Mutant adenoviruses that were ineffective 
in enhancing rAAV transduction (i.e., the El deleted 

mutantr-H5^CBALP-and the-E4-deleted-mutant_dll004)— failed 

to promote the formation of ds forms of AAV. 

Cells infected with adenovirus deleted of E2a 
10 (dl802) or partially deleted of El (dlllO) or E4 (dllOlO) 

additionally demonstrated a band whose size was identical 
to the ds replicative monomer (RFm) of the lacZ rAAV 
genome and whose abundance correlated directly with the 
expression of B-galactosidase activity (compare results 
15 of Example 14 to these described results) . Slower 

migrating concatomers, likely dimers, of duplex rAAV were 
also detected in the autoradiogram described above. 

In the presence of El and E4 expression, rAd 
transduction was invariably accompanied by the appearance 
20 of ds RF monomers and dimers. 

The high molecular weight band in sample lane 
+H5 • CBALP is helper virus DNA. Helper virus DNA is 
recognized by the SV40 probe because the CBALP minigene 
also utilizes the SV40 polyA signal. 

25 (B) In another experiment, HeLa cells were infected 

with vt Ad5 or the E2 deleted mutant dl802 as described 
in Example 10B. Monolayers were harvested 24 hours later 
and analyzed for B-galactosidase activity and RFm 
synthesis. Monomer bands similar to those shown in the 
30 autoradiogram described above were quantitated on a 
Phosphor Imager 445 SI and assigned values (CPM) . 

The results are illustrated in the graphs of 
Figs. 8A and 8B, in which B-galactosidase specific 
activity and CPM are plotted along the vertical axis of 
35 each figure. Adenovirus MOI's are given on the horizontal 
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axis of each figure. Data obtained from low MOI 
infections (l, 5, and 10) are shown. Importantly, the 
tight correlation between rAAV vector transduction and 
the accumulation of duplex forms could be achieved in 
5 cells infected with E1/E4 expressing adenovirus. The 
level of fi-galactosidase and abundance of RFm increased 

in proportion to theamount of inf ectingwildtypeAd 

(Fig. 6A) and dl802 (Fig 6B) . These data suggest that 
synthesis of an episomal duplex intermediate is an 
10 obligatory event in transduction. 

Example 13 ~ Dvrelex End-Analysis 

The following is a description of a model for 
leading strand synthesis of a complementary AAV strand in 
the presence of Rep (+Rap) or absence of Rep (-Rep) . 

15 Refer to Figs. 7A-7F. Rep expresses a terminal 

resolution activity that can convert a duplex structure 
with closed-ends to an open-ended duplex. In the absence 
of Rep, terminal resolution is impaired leaving the 
covalently closed, hairpin structures intact. Under 
20 these conditions, hairpins are expected to be found 

leftward and rightward, since both strands of a rescued 
ds AAV genome are packaged into virions. Figs. 7B-7F are 
a flow chart demonstrating the strategy for identifying 
the terminal structure of duplex RFm that is synthesized 
25 from ss AV.CM VLacZ in response to adenoviral gene 
expression. 

Fig. 7C illustrates a closed end and an open end 
fragment of rAV.C VNLacZ. Figs. 7D, 7E and 7F indicate 
the mixture of open-ended and covalently closed duplex 
30 fragments generated by Notl digestion at position 4509 in 
the absence of terminal resolution. The Notl 4509 
digestion provides a convenient means of releasing a 361 
bp fragment that contains the right ITR in the context of 
a hybridization target (i.e. SV40 pA) . In the presence 
of terminal resolution, only the open-ended 361 bp 
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fragment: would be expected to be generated (Fig. 7D) by 
such digestion. 

The resulting electrophoretic gel (not pictured) , 
revealed in lane (1)' the results of digestion of a 
5 plasmid carrying an AV . CMVIacZ cDNA to release the rAAV 
vector . and subsequent digestion with Notl to release the 

right-terminal-361 -bp-fragment . — In lane (2) a sample of 

Notl digested Hirt DNA extracted from HeLa cells infected 
with wild-type Ad5 and transduced with AV . CM VLacZ 
10 resulted in the release of two fragments, labeled FormI 
and Formll. (See, also. Figs. 8 A and 8B) . The migration 
of ss AV.CMViacZ (SS) and RFm were also seen. 

The ds AV. CMVLacZ intermediates that accumulated in 
cells infected with adenovirus were likely the result of 
15 leading strand DNA synthesis, initiating from the duplex 
region of the vector ITR. In the absence of Rep, this 
conversion event was anticipated to generate molecules in 
which one end is open and the other is covalently closed 
(Fig. 7A) . To further characterize the structure of this 
20 ds intermediate Hirt extracts from cells coinfected with 
rAV . CMBLacZ and Ad5 were digested with Notl to release 
the termini of the ds intermediate which, if left open, 
would be approximately 361 bp in length. The resulting 
filters were hybridized with a probe specific for the 
25 SV40 polyadenylation signal positioned immediately 

upstream of the rightward ITR. At least two forms were 
released from the right end of duplex genomes, one that 
migrated to a position in the gel that predicted an 
open-ended conformation (Form II) , and a second slower 
30 migrating species (Form I) . Although this result was 

consistent with the model (Figs. 7A-7F) , it was difficult 
to predict with certainty the structure of Form I. Its 
retarded mobility did, however, suggest a conformation 
that differed from the open-ended Form II. 
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Sxanplfi .A4. . .r A na l ys is fi l.Ay.» CNV&3CZ .Tgan^flVCt.ifln 

Efficiency in 293 Cells Stablv Transfected with_an 
Inducible E4 ORF6 cPNA 

Cell lines used in this assay were prepared as 
5 described in Example 1. 293 (MT-ORF6) cells and HeLa(MT- 

ORF6) cells were seeded in € well 35 mm culture plates 

(-2x10— cel-ls/wel-1)"— and~infected--with- purif ied,— heat=— — 

treated AV • CMVLacZ at an MOI of 1000 virus 
particles/ cell. Induction of ORF6 expression with from 
10 none to increasing concentrations of zinc sulfate was 
initiated 2 hours before the addition of virus and 
continued throughout the duration of the experiment. 

Twenty-four hours after the addition of virus, cells 
were harvested, lysates were generated by sonication and 
15 analyzed for the 5-galactosidase expression (i.e., 

S-galactosidase activity) and virus DNA as described in 
the preceding examples. Hirt extracts were prepared from 
low molecular weight DNA from cell extracts. The 
preparation of the Hirt extracts and subsequent analysis 
20 by Southern hybridization were performed similarly to 
those described in the examples above. 

The results of this experiment were as follows: 

(1) Specific Activity 

The results are illustrated in the bar 
25 graph of Fig. 8A. Specific activity (milliunits 

5-galactosidase/mg protein) is plotted along the vertical 
axis. Below each bar is given the concentration of zinc 
used for induction, the fold-induction relative to 293 
cells, and the fold- induction relative to 293(ORF6) cells 
30 maintained in the absence of zinc. As shown in Fig. 8A, 
in the absence of Zn+2, the 293(MT-ORF6) cell line 
generated 39-fold higher levels of B-galactosidase in 
rAAV infected 293 cells. Induction of ORF6 expression 
with increasing amounts of Zn +2 resulted in a concomitant 
35 rise in AV.CMVAacZ cell transduction to a level that was 
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445-fold greater than the parent 293 line. Expression of 
El alone was insufficient to have an effect in the 
adenovirus mediated augmentation of rAAV transduction. 

The specific activity of B-galactosidase 
5 was 196.2 mUnits/mg in E1/E4 expressing 293 cells, 

compared to 1.0 mUnit/mg in 293 cells that only expressed 

El-genes^ — These-exper iments— support— a-mechanism— for 

enhancing rAAV transduction that is dependent on the 
combined expression of both El and E4 adenoviral genes. 

10 (2) Molecular Analysis of the AV.CMVJ LacZ 

Genome 

The duplex monomer replicative form (RFm) 
was quantitated and the values (CPM) plotted along the 
vertical axis in the bar graph of Fig. 8B. The 
15 concentration of zinc used for induction and the 

fold- induction relative to 293 (ORF6) cells maintained in 
0 mM zinc is given below each bar. 

An autoradiogram (not pictured) shows the 
agarose gel resolved Hirt extracts from the AV . CMVLacZ 
20 transduced cells described above. A plasmid carrying the 
AV. dWLa.cz cDNA was digested to release the entire 
sequence and loaded in a lane of the autoradiogram. The 
band that appeared in this lane therefore reflected the 
migration of a monomer duplex replicative form (RFm) . 

25 The migration of the ss AV.CM VLa.cZ genome (SS) , RFm, and 
dimers of the duplex replicative form (RFd) were also 
shown. Lanes of the autoradiogram labeled (0), (50), 

(100), (150), (200), and (250) contained samples from 

293 (MT— ORF6) cells that were induced with the indicated 
30 concentration of zinc. A Hirt extract from 293 cells 
(lane labeled 293) transduced with AV.CM VLacZ was also 
shown. 

Analysis of Hirt extracts revealed the 
presence of the RFm in the rAAV infected 293(MT-ORF6) 

35 cells that was not present in similarly infected 293 
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cells. When the induction profiles (Figs. 8A and 8B) 
that describe AV.CMViacZ transduction efficiency were 
compared, the results were plotted in Fig. 8C. Specific 
activity (milliunits B-galactosidase/mg protein) data 
5 from Fig. 8 A and counts-per-minute data (CPM) of 
AV.CM VLacZ RFm from Fig. 8B are plotted along the 

vertical axis, and concentrat ion-of — zinc --sulfate- -used 

during the experiment is shown along the horizontal axis. 

The two profiles are near mirror images. 

10 Importantly, the RFm increased in proportion to the 

increment in lacz transducing activity that occurred as 
ORF-6 expression was induced with Zn +l (Fig. 8C) . Similar 
results were obtained with a 293 derived cell line that 
expresses ORF6 from the glucocorticoid responsive MMTV 
15 promoter. 

Example 15 - Enhanced AV. CMVLacl? Transduction in He La 
Cells Carrying an Inducible ORF6 M lnicrene 

HeLa (MT— 0RF6) cells (2xl0 6 ) were transduced at an 
MOI of 1,000 AV.CKVLacZ recombinant particles/cell in 
20 absence of zinc sulfate inducer or in the presence of 50, 
100, 150, 200, or 250 /xM zinc sulfate inducer in the 
media during transduction. Twenty-four hours later, 
cells were harvested, cell extracts were prepared by 
sonication, and analyzed for transgene expression (i.e., 

25 B-galactosidase activity) . Cell monolayers were 

histochemically stained for B-galactosidase activity. 

The resulting photomicrographs (not pictured) 
illustrated that histochemical staining revealed an 
increase in the number of cells scored lacZ positive as 
30 the concentration of Zn +2 in the medium was raised from 0 
to 200 mM. Concentrations of 250 mM zinc were found to 
be toxic to the cells. 

Specific activity (milliunits B-galactosidase/mg 
protein) is plotted in Fig. 9 along the vertical axis. 
Below each bar is given the concentration of zinc used 
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for induction, -the fold-induction relative to HeLa cells, 
and the fold- induction relative to HeLa ( Mt-ORF 6 ) cells 
maintained in the absence of zinc. Histochemical 
staining revealed an increase in the amount of 
5 fi-galactosidase in lysates as the concentration of Zn+ 3 
in the medium was raised from 0 to 200 mM. 

-Example— 16 Southern— B-lot— Ana-l-vs-i-s— of— Low— Molecular 

Weight DNAs from AV . CMVLacZ Transdu ced HeLa f MT-ORF6 ) 

Cells 

10 Following Induction of E4QRF6 

Hirt extracts were prepared from HeLa (MT— ORF6) cells 
transduced with AV.CMVLa cZ as described in Example 15 in 
the presence of increasing concentrations of Zn +a to 
determine whether synthesis of duplex intermediates 
15 contributed to the augmentation in AV.CMVLacS 
transduction. 

Samples of HeLa (MT— ORF6) cells that were induced 
with a concentration of zinc sulfate (0, 50, 100, 150, 
200, and 250) were resolved on a 1.2% agarose Southern 
20 gel (not pictured) , transferred to a nylon membrane, and 
hybridized with a XacZ-specif ic probe. One lane 
contained a plasmid encoding AV.CM VLacZ that was digested 
to release the entire genome. Bands corresponding to the 
ss AV. aw Lac Z genome (SS) , duplex monomers (RFm) , and 
25 duplex dimers (RFd) were indicated on the gel. 

Southern analysis indicated that Hela and uninduced 
Hela (MT— ORF6 ) cells demonstrated a single band on 
Southern blots which comigrated with the ss genome. 
Induction of ORF-6 resulted in the appearance of 
30 detectable levels of ds monomer but only at higher 

concentrations of Zn+2 . A band comigrating with the RFd 
was present in all cell preparations, the relevance of 
which is unclear since the monomer is a likely precursor 
to the dimer. 




WO 96/39530 



PCT/US96/10245 



62 

Example 17 - Effect of Adenovirus Infection on In Vivo 

AV.CMVI.acg Targetin g Efficiency To Mu rine Liver 

The impact of adenoviral gene expression on rAAV 
transduction in murine liver was studied by sequentially 
5 infusing into the portal vein early gene mutants of 
adenovirus followed by rAAV. 

Balb/c-jnice,—4^-to-6=weeks-old_[ Jackson 

Laboratories, Bar Harbor, Maine] were anesthetized by an 
intraperitoneal injection of ketamine (70 mg/kg) and 
10 xylazine (10 mg/kg) . For liver studies, a 1 cm left 
flank incision was made and the spleen exposed. 
i Samples of purified, heat-treated AV.CMVI.acg in 50 

Ml HBS (lxlO 11 virus particles) were used alone or spiked 
with helper adenovirus containing 2X10 10 particles of 
15 purified dll004, H5.CBALP, or tsl25 in a final volume of 
50 Mi* The dose of adenovirus was sufficient to 
transduce >25% of hepatocytes. The virus mixture was 
injected just beneath the splenic capsule and the abdomen 
was closed with 3-0 vicryl. 

20 Necropsies were performed 3 days post-infusion and 

tissue frozen in O.C.T. embedding compound. Frozen 
sections (6Mm) (LacZ+ALP) were prepared and 
histochemically stained for fi-galactosidase enzyme and 
alkaline phosphatase activity. Sections were 
25 counter stained with neutral red and mounted. 

A fi-galactosidase positive hepatocyte targeted with 
AV. CMVLacZ at magnification 20X was obtained. 
Histochemical analyses of liver tissue harvested 3 days 
after gene transfer demonstrated that administration of 
30 10 u particles of purified rAV. CMVLacZ alone into the 

portal vein was not associated with appreciable gene 
transfer (<0.01% of cells), confirming the inherent 
inefficiency of the rAAV system. 

Preinfusion with E4 deleted virus had no impact on 
35 rAAV transduction in mouse liver, whereas El deleted 
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virus demonstrated a modest increment in lacZ positive 
hepatocytes to about 0.1%. The most significant increase 
in rAAV transduction occurred following infusion of the 
E2a adenovirus mutant tsl25 with lacZ expression detected 
5 in 10-25% of hepatocytes. A direct relationship between 
adenovirus gene expression and rAAV transduction was 

demonstrated in animals infused with both lacZ rAAV and — 

the ALP expressing El deleted virus. The dose of 
adenovirus was reduced 10-fold to minimize the 
10 coincidental occurrence of coinfection. Histochemical 
studies demonstrated co-localization of ALP and B— 
galactosidase in the majority of fi-galactosidase 
expressing hepatocytes. 

Example 18 - Effect of Adenovirus Infection on In Vivo 

15 AV.CMVLacS Targeting Eff iciency T o Murine Lung 

Experiments described in Example 17 for mouse liver 
were adapted for the study of rAAV mediated gene transfer 
to mouse lung. For lung experiments, anesthetized Balb/C 
animals were intubated as described in DeMatteo et al, 

20 Transplantation (Baltimore! . 59.(5) : 787-789 (1995). 

Briefly, a midline 2 cm skin incision was made in the 
neck to expose the trachea. A 2 inch 18 gauge 
angiocatheter was passed through the mouth, positioned in 
the midportion of the trachea, and connected to a rodent 
25 ventilator (#55-3438 Harvard) . Polyethylene (PE#10, 

Intramedic) was fed through the catheter via a side port 
and advanced beyond the tracheal bifurcation. Using a 
Hamilton syringe, virus samples (30pl) were slowly 
infused into the lung through the polyethylene tubing. 

30 Samples contained the same formulation of purified, heat- 
treated AV.CMVLacZ with or without helper adenovirus, as 
described for liver injections. 

Tissue was harvested 72 hours post-infusion. Frozen 
sections were histochemically stained for B-galactosidase 
activity and counterstained with neutral red. 
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Frozen sections from lung (AV. CMViacZ) shoved a B- 
galactosidase positive airway epithelial cell targeted 
with AV.CM VLacZ. similar studies were performed in the 
murine model of lung-directed gene transfer. 

5 Adenoviruses were instilled into the trachea prior to the 
instillation of rAAV. Analysis of lung tissue 3 days 

later revealed only a rare B-ga lactos idase positive cell . 

in animals instilled with rAAV alone. No detectable 
enhancement of rAAV transduction was noted in animals 
10 preinstilled with adenovirus deleted of either El or E4. 
Substantial enhancement of transduction was achieved in 
conducting airway and alveolar cells of animals 
administered the E2a mutant adenovirus. 

These experiments in murine models of gene therapy 
15 directed to liver and lung verified that the efficiency 

of rAAV transduction is low due limited conversion of the 
input ss genome to a transcriptionally active ds 
intermediate, and that this conversion is facilitated by 
expression of adenovirus El and E4 gene products. 

20 Example 19 - Second Generation rAAV with Regulated 
Minicene Capable of Enhancing Tra nsduction 

The experiments described in previous examples 
illustrated the following principles: 1} purified rAAV is 
a relatively inefficient gene transfer vehicle in vitro 
25 and in vivo and 2) the rate limiting step in transduction 
is not viral entry but rather conversion of the virion's 
ss DNA genome to a transcriptionally active ds DNA 
genome. Adenovirus can substantially enhance 
transduction through expression of a subset of its genes. 
30 It does this by promoting conversion of the virion's 

genome to its ds form. One approach to accomplish this 
is to incorporate into the recombinant AAV genome a 
minigene that expresses the minimal adenoviral genes 
necessary to enhance transduction, i.e., the ORF6 region 
of E4 . 
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Two approaches have been considered in designing 
this modified rAAV. The first strategy is based on a 
rAAV genome that has two transcriptional units in series, 
one expressing the therapeutic gene and the other 
5 expressing its E4 ORF6 from a constitutive promoter. 

While this may, in fact, be useful in many situations, 

constitutive - expression— of— ORF6— may— be-detr-imental— -to-the- 

cell and potentially could elicit a destructive immune 
response . 

10 The second version of this rAAV includes the 

therapeutic minigene in addition to the ORF6 
transcriptional unit which, in this case, is expressed 
from an inducible promoter. When this second gene rAAV 
is administered to the cells (ex vivo strategies) or to 
15 the patient (in vivo strategies) , the inducing agents are 
administered at the time of gene transfer or soon 
thereafter. If the ds genomic form or its integrated 
derivative is stable, the induction of ORF6 will only be 
necessary at the time of gene transfer into the recipient 
20 cell. Following this, its inducing agent will be 
withdrawn and the ORF6 gene will be turned off. 

An rAAV that illustrates this concept of inducible 
ORF6 has been constructed and tested in vitro . A 
schematic of the vector pAV . CMVALP . GRE-ORF6 , is shown in 
25 Fig. 11 and its sequence is illustrated in SEQ ID NO: 5. 

This second generation construct contains flanking 5' and 
3' AAV ITR sequences. The human placental alkaline 
phosphatase cDNA (ALP) is included in a minigene in which 
the promoter from the immediate early gene of 
30 cytomegalovirus drives the transcription. A second 

transcriptional unit is cloned between the ITRs in series 
and in direct orientation with the alkaline phosphatase 
minigene. The second transcriptional unit expresses the 
Ad5— E4— ORF6 from a glucocorticoid dependent promoter 
35 (GRE) with an SV40 polyadenylation signal. This is 
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called a second generation rAAV construct. 

Specifically, pAV.CMVALP.GRE-ORF6 [SEQ ID NO: 5] 
generates a novel rAAV containing the LacZ transgene and 
the Ad E4 ORF 6 which facilitates ss to ds conversion of 
5 rAAV. The plasmid includes a flanking AAV 5* ITR 

sequence (nucs. 53*219) ; CMV enhancer /promoter (nucs. 

— 255-848 )-;—human-placenta— alkaline -phosphatase-~cDNA--(ALP}~ 

(nucs. 914-2892); SV40 polyA (nucs. 2893-3090); GRE 
promoter (nucs. 3114-3393); Ad5 E4-ORF6 cDNA (nucs. 3402- 
10 4286); SV40 polyA (nucs. 4315-4512); and 3* AAV ITR 

(nucs. 4547-4713) . All other nucleotides are plasmid 
derived . 

The second generation rAAV construct was used to 
produce and purify rAAV virions which were exposed to 
15 HeLa cells that were left untreated or incubated with 
dexamethasone. In the absence of dexamethasone, (a 
condition under which little 0RF6 should be expressed) , 
little transduction was observed as measured by 
expression of the alkaline phosphatase gene. Cells 
20 incubated in dexamethasone expressed in ORF 6 gene and the 
transduction efficacy was enhanced at least 5-fold. This 
provides evidence to support that a gene product 
expressed from the rAAV can function in cis to enhance 
expression of the transgene. 

25 Example 20 - Application to Bone Marrow Directed Gene 

Bone marrow directed gene therapy represents the 
paradigm of ex vivo gene therapy where the target cell is 
the hematopoietic stem cell. The basic strategy is to 
30 incorporate (i.e«, integrate) a therapeutic minigene into 
the chromosomal DNA of hematopoietic stem cells which are 
transplanted into a recipient patient whose own bone 
marrow has been ablated allowing repopulation of its 
lymphohematopoietic system with progeny of the 
genetically corrected stem cell. 
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The problem with this approach has been efficiently 
transfecting genes into stem cells. Most studies of bone 
marrow directed gene therapy have utilized recombinant 
retroviruses which have not been very efficient. One 
5 problem is that retroviruses integrate their provirus 
only when the target cell is dividing. Unfortunately, 

^most— stem-eel-ls—in— vitro-are-quiescent-and-not-dividing.— 

rAAV holds the promise of integrating the provirus more 
efficiently into non-dividing stem cells. However, 

10 purified rAAV is not very efficient with respect to 
integration when used alone. In cultured cells, 
integration is observed in less than 1% of the cells. 

The same conditions that activate the conversion of ss to 
ds genome also enhance the integration of the ds 
15 intermediate into the chromosomal DNA. 

Therefore, a desirable application of the methods 
and compositions of this invention is in bone marrow 
directed gene therapy. According to this method, stem 
cells are genetically modified with rAAV and an inducing 
20 agent ex vivo using the constructs and methods described 
above (see e.g.. Example 19). Genetically modified stem 
cells are subsequently transplanted by conventional 
techniques . 

Numerous modifications and variations of the present 
25 invention are included in the above-identified 

specification and are expected to be obvious to one of 
skill in the art. Such modifications and alterations to 
the compositions and processes of the present invention, 
such as selections of different transgenes and plasmids 
30 for the construction of the packaging cell lines and 
rAds, or selection or dosage of the viruses or immune 
modulators, are believed to be encompassed in the scope 
of the claims appended hereto. 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1521.. 2405 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



CTGCATGTGT 


CAGAGGTTTT 


CACCGTCATC 


ACCGAAACGC 


GCGAGGCAGC 


50 


AAGCTTGGCA 


GAAATGGTTG 


AACTCCCGAG 


AGTGTCCTAC 


ACCTAGGGGA 


100 


GAAGCAGCCA 


AGGGGTTGTT 


TCCCACCAAG 


GACGACCCGT 


CTGCGCACAA 


150 


ACGGATGAGC 


CCAT C AGACA 


AAGACATATT 


CATTCTCTGC 


TGCAAACTTG 


200 


GCATAGCTCT 


GCTTTGCCTG 


GGGCTATTGG 


GGGAAGTTGC 


GGTTCGTGCT 


250 


CGCAGGGCTC 


TCACCCTTGA 


CTCTTTCAAT 


AATAACTCTT 


CTGTGCAAGA 


300 


TTACAATCTA 


AACAATTCGG 


AGAACTCGAC 


CTTCCTCCTG 


AGGCAAGGAC 


350 


C AC AG C CAAC 


TTCCTCTTAC 


AAGCCGCATC 


GATTTTGTCC 


TTCAGAAATA 


400 


GAAATAAGAA 


TGCTTGCTAA 


AAATTATATT 


TTTACCAATA 


AGACCAATCC 


450 


AATAGGTAGA 


TTATTAGTTA 


CTATGTTAAG 


AAATGAATCA 


TTAT CTTTTA 


500 


GTACTATTTT 


TACTCAAATT 


CAGAAGTTAG 


AAATGGGAAT 


AGAAAATAGA 


550 


AAGAGACGCT 


CAACCTCAAT 


TGAAGAACAG 


GTG C AAGGAC 


TATTGACCAC 


600 


AGGCCTAGAA 


GTAAAAAAGG 


GAAAAAAGAG 


TGTTTTTGTC 


AAAATAGGAG 


650 


ACAGGTGGTG 


GCAACCAGGG 


ACTTATAGGG 


GACCTTACAT 


CTACAGACCA 


700 


ACAGATGCCC 


CCTTACCATA 


TACAGGAAGA 


TATGACTTAA 


ATTGGGATAG 


750 


GTGGGTTACA 


GTCAATGGCT 


ATAAAGTGTT 


ATATAGATCC 


CTCCCCTTTC 


800 


GTGAAAGACT 


CGCCAGAGCT 


AGACCTCCTT 


GGTGTATGTT 


GTCTCAAGAA 


850 


AAGAAAGACG 


ACATGAAACA 


ACAGGTACAT 


GATTATATTT 


ATCTAGGAAC 


900 
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AGGAATGCAC 


TTTTGGGGAA 


AGATTTTCCA 


TACCAAGGAG 


GGGACAGTGG 


950 


CTGGACTAAT 


AGAACATTAT 


TCTGCAAAAA 


CTTATGGCAT 


GAGTTATTAT 


1000 


GATTAGCCTT 


GATTTGCCCA 


AC CTTGCGGT 


TCCCAAGGCT 


TAAGTAAGTT 


1050 


TTTGGTTACA 


AACTGTTCTT 


AAAACAAGGA 


TGTGAGACAA 


GTGGTTTCCT 


1100 


GACTTGGTTT 


GGTATCAAAG 


GTTCTGATCT 


GAGCTCTGAG 


TGTTCTATTT 


1150 


TCCTATGTTC 


TTTTGGAATT 


TAT CCAAATC 


TTATGTAAAT 


GCTTATGTAA 


1200 


ACCAAGATAT 


AAAAGAGTGC 


TGATTTTTTG 


AGTAAACTTG 


CAACAGTCCT 


1250 


AACATTCACC 


TCTTGTGTGT 


TTGTGTCTGT 


TCGCCATCCC 


GTCTCCGCTC 


1300 


GTCACTTATC 


CTTCACTTTC 


CAGAGGGTCC 


CCCCGCAGAC 


CCCGGCGACC 


1350 


CTCAGGTCGG 


CCGACTGCGG 


CAGCTGGCGC 


CCGAACAGGG 


ACCCTCGGAT 


1400 


AAGTGACCCT 


TGTCTTTATT 


TCTACTATTT 


TGTGTTCGT C 


TTGTTTTGTC 


1450 


TCTATCTTGT 


CTGGCTATCA 


TCACAAGAGC 


GGAACGGACT 


CACCTCAGGG 


1500 


AACCAAGCTA 


GCCCAATTCG 


ATGACTACGT 


CCGGCGTTCC 


ATTTGGCATG 


1550 


ACACTACGAC 


CAACACGATC 


TCGGTTGTCT 


CGGCGCACTC 


CGTACAGTAG 


1600 


GGATCGT CTA 


CCTCCTTTTG 


AGACAGAAAC 


CCGCGCTACC 


ATACTGGAGG 


1650 


ATCATCCGCT 


GCTGCCCGAA 


TGTAACACTT 


TGACAATGCA 


CAACGTGAGT 


1700 


TACGTGCGAG 


GTCTTCCCTG 


CAGTGTGGGA 


TTTACGCTGA 


TTCAGGAATG 


1750 


GGTTGTTCCC 


TGGGATATGG 


TTCTAACGCG 


GGAGGAGCTT 


GTAATCCTGA 


1800 


GGAAGTGTAT 


GCACGTGTGC 


CTGTGTTGTG 


CCAACATTGA 


TATCATGACG 


1850 


AGCATGATGA 


TCCATGGTTA 


CGAGTCCTGG 


GCTCTCCACT 


GTCATTGTTC 


1900 


CAGTCCCGGT 


TCCCTGCAGT 


GTATAGCCGG 


CGGGCAGGTT 


TTGGCCAGCT 


1950 


GGTTTAGGAT 


GGTGGTGGAT 


GGCGCCATGT 


TTAATCAGAG 


GTTTATATGG 


2000 


TACCGGGAGG 


TGGTGAATTA 


CAACATGCCA 


AAAGAGGTAA 


TGTTTATGTC 


2050 


CAGCGTGTTT 


ATGAGGGGT C 


GCCACTTAAT 


CTACCTGCGC 


TTGTGGTATG 


2100 


ATGGCCACGT 


GGGTTCTGTG 


GTCCCCGCCA 


TGAGCTTTGG 


ATACAGCGCC 


2150 


TTGCACTGTG 


GGATTTTGAA 


CAATATTGTG 


GTGCTGTGCT 


GCAGTTACTG 


2200 
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TGCTGATTTA 


AGTGAGATCA 


GGGTGCGCTG 


CTGTGCCCGG 


AGGACAAGGC 


2250 


GCCTTATGCT 


GCGGGCGGTG 


CGAATCATCG 


CTGAGGAGAC 


CACTGCCATG 


2300 


TTGTATTCCT 


GCAGGACGGA 


GCGGCGGCGG 


CAGCAGTTTA 


TTCGCGCGCT 


2350 


GCTGCAGCAC 


CACCGCCCTA 


TCCTGATGCA 


CGATTATGAC 


TCTACCCCCA 


2400 


TGTAGGGATC 


CAAGCTTGCG 


GGCGCATCGA 


TGATATCAAG 


CTTGCATGCC 


2450 


TGCAGGTCGA 


CTCTAGAGGA 


TCCCGGGTGG 


NATCCCTGTG 


ACCCCTCCCC 


2500 


AGTGCCTCTC 


CTGGCCCTGG 


AAGTTGG C AC 


TCCAGTGCCC 


ACCAGCCTTG 


2550 


TCCTAATAAA 


ATTAAGTTGN 


ATCATTTTGT 


CTGACTAGGT 


GTCCTTCTAT 


2600 


AATATTATGG 


GGTGGAGGGG 


GGTGGTATGG 


AGCAANGGGN 


AANTTGGNAA 


2650 


GACAANCTGT 


AGGGCCTGCG 


GGGTCTATTG 


GGAACAAGCT 


GGAGTGCAGT 


2700 


GGCACAATCT 


TGGCTCACTG 


CAATCTCCGC 


CTCCTGGGTT 


CAAGCGATTC 


2750 


TCCTGCCTCA 


GACTCCCGAG 


TTGTTGGGAT 


TCCAGGCATG 


CATGACCAGG 


2800 


CTCAGATAAT 


TTTTGTTTTT 


TTGGTAGAGA 


CGGGGTTTCA 


CCATATTGGN 


2850 


CAGGCTGGTC 


TCCAACTCCT 


AAT CTCAGGT 


GATCTNCCCA 


CCTTGGCCTC 


2900 


CCAAATTGCT 


GGGATTACAG 


GNGTGAACCA 


CTGNTCCCTT 


CCCTGTCCTT 


2950 


CTGATTTTAA 


AATAACTATA 


CCAGCAGGAG 


GACGTCCAGA 


CACAGCATAG 


3000 


GCTACCTGGC 


CATGCCCAAC 


CGGTGGGACA 


TTTGAGTTGC 


TTGCTTGGCA 


3050 


CTGTCCTCTC 


ATGCGTTGGG 


TCCACTCAGT 


AGATGCCTGT 


TGAATTGGGT 


3100 


ACGCGGCCAG 


CTTGGCTGTG 


GAATGTGTGT 


CAGTTAGGGT 


GTGGAAAGTC 


3150 


CCCAGGCTCC 


CCAGCAGGCA 


GAAGTATGCA 


AAGCATGCAT 


CT CAATTAGT 


3200 


CAGCAACCAG 


GTGTGGAAAG 


TCCCCAGGCT 


CCCCAGCAGG 


CAGAAGTATG 


3250 


CAAAGCATGC 


ATCTCAATTA 


GTCAGNAACC 


ATAGNCCCGC 


CCCTAACTCC 


3300 


GTCCATCCCG 


GCCCTAACTC 


NGGCCAGTTC 


CGACCNTNCT 


C CGGCNNATG 


3350 


GNTGAGTAAT 


TTGCNNGATT 


TATGCAGNGG 


GCGAGGNCGC 


CTCGGGCTCT 


3400 


GAGNTNTTCC 


AGAAGTAGTG 


AGGAGGCTTT 


NNTGGTGGAA 


TTG AT C AG CT 


3450 


TGGGATCTGA 


TCAAGAGACA 


GGATGAGGAT 


CGNNN CGNAT 


GATTGAACAA 


3500 
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GATGGGTTGC 


ACGGAGGTT C 


TCCGGNCGCT 


TGGGTGGGGA 


GGNTATT CGG 


3550 


NTATTNTTGG 


TGNACAACAG 


NNAAACGGNT 


GTTCTGATGC 


CGCCGCGTTC 


3600 


NCGCTTTCAG 


NGCAGGGGGG 


CCCCCCTTCT 


NTTGAGANNA 


GCNCCCCTTN 


3650 


TTG 










3653 



( 2 ) INFORMATION FOR SEQ IDNO:2.: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met 

1 


Thr 


Thr 


Ser 


Gly Val 
5 


Pro 


Phe 


Gly 


Met 

10 


Thr 


Leu 


Arg 


Pro 


Thr 

15 


Arg 


Ser 


Arg 


Leu 


Ser 

20 


Arg 


Arg 


Thr 


Pro 


Tyr 

25 


Ser 


Arg 


Asp 


Arg 


Leu 

30 


Pro 


Pro 


Phe 


Glu 


Thr 

35 


Glu 


Thr 


Arg 


Ala 


Thr 

40 


lie 


Leu 


Glu 


Asp 


His 

45 


Pro 


Leu 


Leu 


Pro 


Glu 

50 


Cys 


Asn 


Thr 


Leu 


Thr 

55 


Met 


His 


Asn 


Val 


Ser 

60 


Tyr 


Val 


Arg 


Gly 


Leu 

65 


Pro 


Cys 


Ser 


Val 


Gly 

70 


Phe 


Thr 


Leu 


lie 


Gin 

75 


Glu 


Trp 


Val 


Val 


Pro 

80 


Trp 


Asp 


Met 


Val 


Leu 

85 


Thr 


Arg 


Glu 


Glu 


Leu 

90 


val 


lie 


Leu 


Arg 


Lys 

95 


Cys 


Met 


His 


Val 


Cys 

100 


Leu 


Cys 


Cys 


Ala 


Asn 

105 


lie 


Asp 


lie 


Met 


Thr 

110 


Ser 


Met 


Met 


lie 


Tyr 

115 


Gly 


Tyr 


Glu 


Ser 


Trp 

120 


Ala 


Leu 


His 


Cys 


His 

125 


Cys 


Ser 


Ser 


Pro 


Gly 

130 


Ser 


Leu 


Gin 


Cys 


He 

135 


Ala 


Gly 


Gly 


Gin 


Val 

140 


Leu 


Ala 


Ser 


Trp 


Phe Arg 
145 


Met 


Val 


Val 


Asp 

150 


Gly 


Ala 


Met 


Phe 


Asn 

155 


Gin 


Arg 


Phe 


lie 


Trp 

160 
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Tyr 


Arg 


Glu 


Val 


Val 

165 


Asn 


Tyr 


Asn 


Met 


Pro 
17 0 


Lys 


Glu 


Val 


Met 


Phe 

175 


Met 


Ser 


Ser 


Val 


Phe 

180 


Met Arg 


Gly 


Arg 


His 

185 


Leu 


lie 


Tyr 


Leu 


Arg 

190 


Tyr 


Trp 


Tyr 


Asp 


Gly His 
195 


Val 


Gly 


Ser 


Val 

200 


Val 


Pro 


Ala 


Met 


Ser 

205 


Phe 


Gly 


Tyr 


Ser- 


-Ala- 


-Leu- 


-His- 


Cys 


-Gly- 


-I-le 

215 


-Leu- 


-Asn 


-Asn- 


-He 


Val- 

220 


Val- 


Leu 


-Cys- 


PVR 




210 


wjr d - 


Ser 

225 


Tyr 


Cys 


Ala 


Asp 


Leu 

230 


Ser 


Glu 


lie 


Arg 


Val 

235 


Arg 


Cys 


Cys 


Ala 


Arg 

240 


Arg 


Thr 


Arg 


Arg 


Leu 

245 


Met 


Leu 


Arg 


Ala 


Val 

250 


Arg 


lie 


lie 


Ala 


Glu 

255 


Glu 


Thr 


Thr 


Ala 


Met 

260 


Leu 


Tyr 


Ser 


Cys 


Arg 

265 


Thr 


Glu 


Arg Arg 


Arg 

270 


Gin 


Gin 


Phe 


lie Arg 
275 


Ala 


Leu 


Leu 


Gin 


His 

280 


His 


Arg 


Pro 


lie 


Leu 

285 


Met 


His 


Asp 


Tyr 


Asp 


Ser 


Thr 


Pro 


Met 























290 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: other nucleic acid 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 50 

GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 100 

TAGTAGTGTG GCGGAAGTGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 150 

GCGACGGATG TGGCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 200 

GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 250 

CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 300 
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AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


350 


GGGAG AT CAG 


CCTGCAGGTC 


GTTACATAAC 


TTACGGTAAA 


TGGCCCGCCT 


400 


GGCTGACCGC 


CCAACGACCC 


CCGCCCATTG 


ACGTCAATAA 


TGACGTATGT 


450 


TCCCATAGTA 


ACGCCAATAG 


GGACTTTCCA 


TTGACGTCAA 


TGGGTGGAGT 


500 


ATTTACGGTA 


AACTGCCCAC 


TTGGCAGTAC 


ATCAAGTGTA 


TCATATGCCA 


550 


AGTACGCCCC 


CTATTGACGT 


CAATGACGGT 


AAATGGCCCG 


CCTGGCATTA 


600 


TGCCCAGTAC 


ATG AC CTTAT 


GGGACTTTCC 


TACTTGGCAG 


TACATCTACT 


650 


CGAGGCCACG 


TTCTGCTTCA 


CTCTCCCCAT 


CTCCCCCCCC 


TCCCCACCCC 


700 


CAATTTTGTA 


TTTATTTATT 


TTTTAATTAT 


TTTGTGCAGC 


GATGGGGGCG 


750 


GGGGGGGGGG 


GGGGGCGCGC 


GCCAGGCGGG 


GCGGGGCGGG 


GCGAGGGGCG 


800 


GGG CGGGGCG 


AGGCGGAGAG 


GTGCGGCGGC 


AGCCAATCAG 


AGCGGCGCGC 


850 


TCCGAAAGTT 


TCCTTTTATG 


GCGAGGCGGC 


GGCGGCGGCG 


GCCCTATAAA 


900 


AAGCGAAGCG 


CGCGGC GGGC 


GGGAGCGGGA 


TCAGCCACCG 


CGGTGGCGGC 


950 


CGCAATTCCC 


GGGGATCGAA 


AGAGCCTGCT 


AAAGCAAAAA 


AGAAGTCACC 


1000 


ATGTCGTTTA 


CTTTGACCAA 


CAAGAACGTG 


ATTTTCGTTG 


CCGGTCTGGG 


1050 


AGGCATTGGT 


CTGGACACCA 


GCAAGGAGCT 


GCTCAAGCGC 


GATCCCGTCG 


1100 


TTTTACAACG 


TCGTGACTGG 


GAAAACCCTG 


G CGTTACCC A 


ACTTAATCGC 


1150 


CTTGCAGCAC 


ATCCCCCTTT 


CGCCAGCTGG 


CGTAATAGCG 


AAGAGGCCCG 


1200 


CACCGATCGC 


CCTTCCCAAC 


AGTTGCGCAG 


CCTGAATGGC 


GAATGGCGCT 


1250 


TTGCCTGGTT 


TCCGGCACCA 


GAAGCGGTG C 


CGGAAAGCTG 


GCTGGAGTGC 


1300 


GATCTTCCTG 


AGGCCGATAC 


TGTCGTCGTC 


CCCTCAAACT 


GGCAGATGCA 


1350 


CGGTTACGAT 


GCGCCCATCT 


ACACCAACGT 


AACCTATCCC 


ATTACGGTCA 


1400 


ATCCGCCGTT 


TGTTCCCACG 


GAGAATCCGA 


CGGGTTGTTA 


CTCGCTCACA 


1450 


TTTAATGTTG 


ATGAAAGCTG 


GCTACAGGAA 


GGC CAGACGC 


GAATTATTTT 


1500 


TGATGGCGTT 


AACTCGGCGT 


TTCATCTGTG 


GTGCAACGGG 


CGCTGGGTCG 


1550 


GTTACGGCCA 


GGACAGTCGT 


TTGCCGTCTG 


AATTTGACCT 


GAGCGCATTT 


1600 
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TTA CGCGCCG 


GAGAAAACCG 


CCTCGCGGTG 


ATGGTGCTGC 


GTTGGAGTGA 


1650 


CGGCAGTTAT 


CTGGAAGATC 


AGGATATGTG 


GCGGATGAGC 


GGCATTTTCC 


1700 


GTGACGTCTC 


GTTGCTGCAT 


AAACCGACTA 


CACAAATCAG 


CGATTTCCAT 


1750 


GTTGCCACTC 


GCTTTAATGA 


TGATTTCAG C 


CGCGCTGTAC 


TGGAGGCTGA 


1800 


AGTTCAGATG 


TGCGGCGAGT 


TGCGTGACTA 


CCTACGGGTA 


ACAGTTTCTT 


1850 


TATGGCAGGG 


TGAAACGCAG 


GTCGCCAGCG 


GCACCGCGCC 


TTTCGGCGGT 


1900 


GAAATTATCG 


ATGAGCGTGG 


TGGTTATGCC 


GATCGCGTCA 


CACTACGTCT 


1950 


GAACGTCGAA 


AACCCGAAAC 


TGTGGAGCGC 


CGAAATCCCG 


AATCTCTATC 


2000 


GTGCGGTGGT 


TGAACTGCAC 


ACCGCCGACG 


GCACGCTGAT 


TGAAGCAGAA 


2050 


GCCTGCGATG 


TCGGTTTCCG 


CGAGGTGCGG 


ATTGAAAATG 


GTCTGCTGCT 


2100 


GCTGAACGG C 


AAGCCGTTGC 


TGATTCGAGG 


CGTTAACCGT 


CACGAGCATC 


2150 


ATCCTCTGCA 


TGGTCAGGTC 


ATGGATGAGC 


AGACGATGGT 


GCAGGATATC 


2200 


CTGCTGATGA 


AGCAGAACAA 


CTTTAACGCC 


GTGCGCTGTT 


CGCATTATCC 


2250 


GAACCATCCG 


CTGTGGTACA 


CGCTGTGCGA 


CCGCTACGGC 


CTGTATGTGG 


2300 


TGG ATGAAG C 


CAATATTGAA 


ACCCACGGCA 


TGGTGCCAAT 


GAATCGTCTG 


2350 


ACCGATGATC 


CGCGCTGGCT 


ACCGGCGATG 


AGCGAACGCG 


TAACGCGAAT 


2400 


GGTGCAGCGC 


GATCGTAATC 


ACCCGAGTGT 


GATCATCTGG 


TCGCTGGGGA 


2450 


ATGAATCAGG 


CCACGGCG CT 


AATCACGACG 


CGCTGTATCG 


CTGGATCAAA 


2500 


TCTGTCGATC 


CTTCCCGCCC 


GGTGCAGTAT 


GAAGGCGGCG 


GAGCCGACAC 


2550 


CACGGCCACC 


GATATTATTT 


GCCCGATGTA 


CGCGCGCGTG 


GATGAAGACC 


2600 


AGCCCTTCCC 


GGCTGTGCCG 


AAATGGTCCA 


TCAAAAAATG 


GCTTTCGCTA 


2650 


CCTGGAGAGA 


CGCGCCCGCT 


GATCCTTTGC 


GAATACGCCC 


ACGCGATGGG 


2700 


TAACAGTCTT 


GGCGGTTTCG 


CTAAATACTG 


GCAGGCGTTT 


CGTCAGTATC 


2750 


CCCG TTTACA 


GGGCGGCTTC 


GTCTGGGACT 


GGGTGGATCA 


GTCGCTGATT 


2800 


AAATATGATG 


AAAACGGCAA 


CCCGTGGTCG 


GCTTACGGCG 


GTGATTTTGG 


2850 


CGATACGCCG 


AACGATCGCC 


AGTTCTGTAT 


GAACGGTCTG 


GTCTTTGCCG 


2900 
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ACCGGACGCC 


GCATCCAGCG 


CTGACGGAAG 


CAAAACACCA 


GCAGCAGTTT 


2950 


TTCCAGTTCC 


GTTTATCCGG 


GCAAACCATC 


GAAGTGACCA 


GCGAATACCT 


3000 


GTTCCGTCAT 


AGCGATAACG 


AGCTCCTGCA 


CTGGATGGTG 


GCGCTGGATG 


3050 


GTAAGCCGCT 


GGCAAGCGGT 


GAAGTGCCTC 


TGGATGTCGC 


TCCACAAGGT 


3100 


AAACAGTTGA 


TTGAACTG CC 


TGAACTACCG 


CAGCCGGAGA 


GCGCCGGGCA 


3150 


ACTCTGGCTC 


ACAGTACGCG 


TAGTGCAACC 


GAACGCGACC 


GCATGGTCAG 


3200 


AAGCCGGGCA 


CATCAGCGCC 


TGGCAGCAGT 


GGCGTCTGGC 


GGAAAACCTC 


3250 


AGTGTGACGC 


TCCCCGCCGC 


GTCCCACGCC 


ATCCCGCATC 


TGACCACCAG 


3300 


CGAAATGGAT 


TTTTGCATCG 


AGCTGGGTAA 


TAAGCGTTGG 


CAATTTAACC 


3350 


GCCAGTCAGG 


CTTTCTTTCA 


CAGATGTGGA 


TTGGCGATAA 


AAAACAACTG 


3400 


CTGACGCCGC 


TGCGCGATCA 


GTTCACCCGT 


GCACCGCTGG 


ATAACGACAT 


3450 


TGGCGTAAGT 


GAAGCGACCC 


GCATTGACCC 


TAACGCCTGG 


GTCGAACGCT 


3500 


GGAAGGCGGC 


GGGCCATTAC 


CAGGCCGAAG 


CAGCGTTGTT 


GCAGTGCACG 


3550 


GCAGATACAC 


TTGCTGATGC 


GGTG CTG ATT 


ACGACCGCTC 


ACGCGTGGCA 


3600 


GCATCAGGGG 


AAAACCTTAT 


TTATCAGCCG 


GAAAACCTAC 


CGGATTGATG 


3650 


GTAGTGGTCA 


AATGGCGATT 


ACCGTTGATG 


TTGAAGTGGC 


GAGCGATACA 


3700 


CCGCATCCGG 


CGCGGATTGG 


CCTGAACTGC 


CAGCTGGCGC 


AGGTAGCAGA 


3750 


GCGGGTAAAC 


TGGCTCGGAT 


TAGGGCCGCA 


AGAAAACTAT 


CCCGACCGCC 


3800 


TTACTGCCGC 


CTGTTTTGAC 


CGCTGGGATC 


TGCCATTGTC 


AGACATGTAT 


3850 


ACCCCGTACG 


TCTTCCCGAG 


CGAAAACGGT 


CTGCGCTGCG 


GGACGCGCGA 


3900 


ATTGAATTAT 


GGCCCACACC 


AGTGGCGCGG 


CGACTTCCAG 


TTCAACATCA 


3950 


GCCGCTACAG 


TCAACAGCAA 


CTGATGGAAA 


CCAGCCATCG 


CCATCTGCTG 


4000 


CACGCGGAAG 


AAGGCACATG 


GCTGAATATC 


GACGGTTTCC 


ATATGGGGAT 


4050 


TGGTGGCGAC 


GACTCCTGGA 


GCCCGTCAGT 


ATCGGCGGAA 


TTACAGCTGA 


4100 


GCGCCGGTCG 


CTACCATTAC 


CAGTTGGTCT 


GGTGTCAAAA 


ATAATAATAA 


4150 


CCGGGCAGGC 


CATGTCTGCC 


CGTATTTCGC 


GTAAGGAAAT 


CCATTATGTA 


4200 
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CTATTTAAAA 


AACACAAACT 


TTTGGATGTT 


CGGTTTATTC 


TTTTTCTTTT 


4250 


ACTTTTTTAT 


CATGGGAGCC 


TACTTCCCGT 


TTTTCCCGAT 


TTGGCTACAT 


4300 


GACATCAACC 


ATATCAGCAA 


AAGTGATACG 


GGTATTATTT 


ttgccgctat 


4350 


TTCTCTGTTC 


TCGCTATTAT 


TCCAACCGCT 


GTTTGGTCTG 


CTTTCTGACA 


4400 


AACTCGGCCT 


CGACTCTAGG 


CGGCCGCGGG 


GATCCAGACA 


TGATAAGATA 


4450 


CATTGATGAG 


TTTGGACAAA 


CCACAACTAG 


AATGCAGTGA 


AAAAAATGCT 


” 4500 


TTATTTGTGA 


AATTTGTGAT 


GCTATTGCTT 


TATTTGTAAC 


CATTATAAGC 


4550 


TGCAATAAAC 


AAGTTAACAA 


CAACAATTGC 


ATTCATTTTA 


TGTTTCAGGT 


4600 


TCAGGGGGAG 


GTGTGGGAGG 


TTTTTTCGGA 


TCCTCTAGAG 


TCGACCTGCA 


4650 


GGCTGATCAG 


TGGAAGGTGC 


TGAGGTACGA 


TGAGACCCGC 


ACCAGGTGCA 


4700 


GACCCTGCGA 


GTGTGGCGGT 


AAACATATTA 


GGAACCAGCC 


TGTGATGCTG 


4750 


GATGTGACCG 


AGGAGCTGAG 


GCCCGATCAC 


TTGGTGCTGG 


CCTGCACCCG 


4800 


CGCTGAGTTT 


GGCTCTAGCG 


ATGAAGATAC 


AGATTGAGGT 


ACTGAAATGT 


4850 


GTGGGCGTGG 


CTTAAGGGTG 


GGAAAGAATA 


TATAAGGTGG 


GGGTCTTATG 


4900 


TAGTTTTGTA 


TCTGTTTTGC 


AGCAGCCGCC 


GCCGCGATGA 


GCACCAACTC 


4950 


GTTTGATGGA 


AGCATTGTGA 


GCT CATATTT 


GACAACGCGC 


ATGCCCCCAT 


5000 


GGGCCGGGGT 


GCGTCAGAAT 


GTGATGGG CT 


CCAGCATTGA 


TGGTCGCCCC 


5050 


GTCCTGC CCG 


CAAACTCTAC 


TACCTTGACC 


TACGAGACCG 


TGTCTGGAAC 


5100 


GCCGTTGGAG 


ACTGCAGCCT 


CCGCCGCCGC 


TTCAGCCGCT 


GCAGCCACCG 


5150 


CCCGCGGGAT 


TGTGACTGAC 


TTTGCTTTCC 


TGAGCCCGCT 


TGCAAGCAGT 


5200 


GCAGCTTCCC 


GTTCATCCGC 


CCGCGATGAC 


AAGTTGACGG 


CTCTTTTGGC 


5250 


ACAATTGGAT 


TCTTTGACCC 


GGGAACTTAA 


TGTCGTTTCT 


CAGCAGCTGT 


5300 


TGGATCTGCG 


CCAGCAGGTT 


TCTGCCCTGA 


AGGCTTCCTC 


CCCTCCCAAT 


5350 


GCGGTTTAAA 


ACATAAATAA 


AAAACCAGAC 


TCTGTTTGGA 


TTTGGATCAA 


5400 


GCAAGTGTCT 


TGCTGTCTTT 


ATTTAGGGGT 


TTTGCGCGCG 


CGGTAGGCCC 


5450 


GGGACCAGCG 


GTCTCGGTCG 


TTGAGGGTCC 


TGTGTATTTT 


TTCCAGGACG 


5500 




WO 96/39530 



PCT/US96/1 0245 



78 



TGGTAAAGGT 


GACTCTGGAT 


GTTCAGATAC 


ATGGGCATAA 


GCCCGTCTCT 


5550 


GGGGTGGAGG 


TAGCACCACT 


GCAGAGCTTC 


ATGCTGCGGG 


GTGGTGTTGT 


5600 


AGATGATCCA 


GTCGTAGCAG 


GAGCGCTGGG 


CGTGGTGCCT 


AAAAATGTCT 


5650 


TTCAGTAGCA 


AGCTGATTGC 


CAGGGGCAGG 


CCCTTGGTGT 


AAGTGTTTAC 


5700 


AAAGCGGTTA 


AGCTGGGATG 


GGTGCATACG 


TGGGGATATG 


AGATGCATCT 


5750 


TGGACTGTAT 


TTTTAGGTTG 


GCTATGTTCC 


CAGCGATATC 


CCTCCGGGGA 


5800 


TTCATGTTGT 


GCAGAACCAC 


CAGCACAGTG 


TATC CGGTGC 


ACTTGGGAAA 


5850 


TTTGTCATGT 


AGCTTAGAAG 


GAAATGCGTG 


GAAGAACTTG 


GAGACGCCCT 


5900 


TGTGACCTCC 


AAGATTTTCC 


ATGCATTCGT 


CCATAATGAT 


GGCAATGGGC 


5950 


CCAC GGGCGG 


CGGCCTGGGC 


GAAGATATTT 


CTGGGATCAC 


TAACGTCATA 


6000 


GTTGTGTTC C 


AGGATGAGAT 


CGTCATAGGC 


CATTTTTACA 


AAGCGCGGGC 


6050 


GGAGGGTGCC 


AGACTGCGGT 


ATAATGGTTC 


CATCCGGCCC 


AGGGGCGTAG 


6100 


TTACCCTCAC 


AGATTTGCAT 


TTCCCACGCT 


TTGAGTTCAG 


ATGGGGGGAT 


6150 


CATGTCTACC 


TGCGGGGCGA 


TGAAGAAAAC 


GGTTTCCGGG 


GTAGGGGAGA 


6200 


TCAGCTGGGA 


AGAAAGCAGG 


TTCCTGAGCA 


GCTGCGACTT 


ACCGCAGCCG 


6250 


GTGGGCCCGT 


AAATCACACC 


TATTACCGGG 


TGCAACTGGT 


AGTTAAGAGA 


6300 


GCTGCAGCTG 


CCGTCATCCC 


TGAGCAGGGG 


GGCCACTTCG 


TTAAGCATGT 


6350 


CCCTGACTCG 


CATGTTTTCC 


CTGACCAAAT 


CCGCCAGAAG 


GCGCTCGCCG 


6400 


CCCAGCGATA 


GCAGTTCTTG 


CAAGGAAGCA 


AAGTTTTTCA 


ACGGTTTGAG 


6450 


ACCGTCCGCC 


GTAGGCATGC 


TTTTGAGCGT 


TTGACCAAGC 


AGTTCCAGGC 


6500 


GGTCCCACAG 


CTCGGTCACC 


TGCTCTACGG 


CATCTCGATC 


CAG CATATCT 


6550 


CCT CGTTTCG 


CGGGTTGGGG 


CGGCTTTCGC 


TGTACGGCAG 


TAGTCGGTGC 


6600 


TCGTCCAGAC 


GGGCCAGGGT 


CATGTCTTTC 


CACGGG CGC A 


GGGTCCTCGT 


6650 


CAGCGTAGTC 


TGGGTCACGG 


TGAAGGGGTG 


CGCT CCGGGC 


TGCGCGCTGG 


6700 


CCAGGGTGCG 


CTTGAGGCTG 


GTCCTGfiTGG 


TGCTGAAGCG 


CTGCCGGTCT 


6750 


TCGCCCTGCG 


CGTCGGCCAG 


GTAGCATTTG 


ACCATGGTGT 


CATAGTCCAG 


6800 
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CCCCTCCGCG 


GCGTGGCCCT 


TGGCGCGCAG 


CTTGCCCTTG 


GAGGAGGCGC 


6850 


CGCACGA GGG 


GCAGTGCAGA 


CTTTTGAGGG 


CGTAGAGCTT , GGGCGCGAGA 


6900 


AATACCGATT 


CCGGGGAGTA 


GGCATCCGCG 


CCGCAGGCCC 


CGCAGACGGT 


6950 


CTCGCATTCC 


ACGAGCCAGG 


TGAGCTCTGG 


CCGTTCGGGG 


TCAAAAACCA 


7000 


GGTTTCCCCC 


ATGCTTTTTG 


ATGCGTTTCT 


TACCTCTGGT 


TTCCATGAGC 


7050 


CGGTGTCCAC 


GCTCGGTGAC 


GAAAAGGCTG 


TCCGTGTCCC 


CGTATACAGA 


7100 


CTTGAGAGGC 


CTGTCCTCGA 


G CGGTGTTCC 


GCGGTCCTCC 


TCGTATAGAA 


7150 


ACTCGGACCA 


CTCTGAGACA 


AAGGCTCGCG 


TCCAGGCCAG 


CACGAAGGAG 


7200 


GCTAAGTGGG 


AGGGGTAGCG 


GTCGTTGTCC 


ACTAGGGGGT 


CCACTCGCTC 


7250 


CAGGGTGTGA 


AGACACATGT 


CGCCCTCTTC 


GGCATCAAGG 


AAGGTGATTG 


7300 


GTTTGTAGGT 


GTAGGCCACG 


TGACCGGGTG 


TTCCTGAAGG 


GGGGCTATAA 


7350 


AAGGGGGTGG 


GGGCGCGTTC 


GTCCTCACTC 


tcttccgcat 


CGCTGTCTGC 


7400 


GAGGGCCAGC 


TGTTGGGGTG 


AGTACTCCCT 


CTGAAAAGCG 


GGCATGACTT 


7450 


CTGCGCTAAG 


ATTGTCAGTT 


TCCAAAAACG 


AGGAGGATTT 


GATATTCACC 


7500 


TGGCCCGCGG 


TGATGCCTTT 


GAGGGTGG CC 


GCATCCATCT 


GGTCAGAAAA 


7550 


GACAATCTTT 


TTGTTGTCAA 


GCTTGGTGGC 


AAACGACCCG 


TAGAGGGCGT 


7600 


TGGACAGCAA 


CTTGGCGATG 


GAGCGCAGGG 


TTTGGTTTTT 


GTCGCGATCG 


7650 


GCGCGCTCCT 


TGGCCGCGAT 


GTTTAGCTGC 


ACGTATTCGC 


GCGCAACGCA 


7700 


CCGCCATTCG 


GGAAAGACGG 


TGGTGCGCTC 


GTCGGGCACC 


AGGTGCACGC 


7750 


GCCAACCGCG 


GTTGTGCAGG 


GTGACAAGGT 


CAACGCTGGT 


GGCTACCTCT 


7800 


CCGCGTAGGC 


GCTCGTTGGT 


CCAGCAGAGG 


CGGCCGCCCT 


TGCGCGAGCA 


7850 


GAATGGCGGT 


AGGGGGTCTA 


GCTGCGTCTC 


GTCCGGGGGG 


TCTGCGTCCA 


7900 


CGGTAAAGAC 


CCCGGGCAGC 


AGGCGCGCGT 


CGAAGTAGTC 


TATCTTG CAT 


7950 


CCTTGCAAGT 


CTAGCGCCTG 


CTGCCATGCG 


CGGGCGGCAA 


GCGCGCGCTC 


8000 


GTATGGGTTG 


AGTGGGGGAC 


CCCATGGCAT 


GGGGTGGGTG 


AGCGCGGAGG 


8050 


CGTACATGCC 


GCAAATGTCG 


TAAACGTAGA 


GGGGCTCTCT 


GAGTATTCCA 


8100 
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AGATATGTAG GGTAGCATCT TCCACCGCGG ATGCTGGCGC GCACGTAATC 8150 
GTATAGTTCG TGCGAGGGAG CGAGGAGGTC GGGAC CGAGG TTG CTACGGG 8200 

CGGGCTGCTC TGCTCGGAAG ACTATCTGCC TGAAGATGGC ATGTGAGTTG 8250 
GATGATATGG TTGGACGCTG GAAGACGTTG AAGCTGGCGT CTGTGAGACC 8300 
TACCGCGTCA CGCACGAAGG AGGCGTAGGA GTCGCGCAGC TTGTTGACCA 8350 
GCTCGGCGGT GACCTGCACG TCTAGGGCGC AGTAGTCCAG GGTTTCCTTG 8400 
ATGATGTCAT ACTTATCCTG TCCCTTTTTT TTCCACAGCT CGCGGTTGAG 8450 
GACAAACTCT TCGCGGTCTT TCCAGTACTC TTGGATCGGA AACCCGTCGG 8500 
CCTCCGAACG GTAAGAGCCT AGCATGTAGA ACTGGTTGAC GGCCTGGTAG 8550 
GCGCAGCATC CCTTTTCTAC GGGTAGCGCG TATGCCTGCG CGGCCTTCCG 8600 
GAGCGAGGTG TGGGTGAGCG CAAAGGTGTC CCTGACGATG ACTTTGAGGT 8650 
ACTGGTATTT GAAGTCAGTG TCGTCGCATC CGCCCTGCTC CCAGAGCAAA 8700 
AAGTCCGTGC GCTTTTTGGA ACG CGGATTT GGCAGGGCGA AGGTGACATC 8750 
GTTGAAGAGT ATCTTTCCCG CGCGAGGCAT AAAGTTGCGT GTGATGCGGA 8800 
AGGGTC C CGG CACCTCGGAA CGGTTGTTAA TTACCTGGGC GGCGAGCACG 8850 
ATCTCGTCAA AGCCGTTGAT GTTGTGGCCC ACAATGTAAA GTTC CAAGAA 8900 

GCGCGGGATG CCCTTGATGG AAGGCAATTX TTTAAGTTCC TCGTAGGTGA 8950 

GCTCTTCAGG GGAGCTGAGC CCGTGCTCTG AAAGGGCCCA GTCTGCAAGA 9000 

TGAGGGTTGG AAGCGACGAA TGAGCTCCAC AGGTCACGGG CCATTAGCAT 9050 

TTGCAGGTGG TCGCGAAAGG TCCTAAACTG GCGACCTATG GCCATTTTTT 9100 

CTGGGGTGAT GCAGTAGAAG GTAAGCGGGT CTTGTTCCCA GCGGTCCCAT 9150 

CCAAGGTTCG CGG CTAGGT C TCGCGCGGCA GTCACTAGAG GCTCATCTCC 9200 

GCCGAACTTC ATGACCAGCA TGAAGGGCAC GAGCTGCTTC CCAAAGGCCC 9250 

CCATCCAAGT ATAGGTCTCT ACATCGTAGG TGACAAAGAG ACGCTCGGTG 9300 

CGAGGATGCG AGCCGATCGG GAAGAACTGG ATCTCCCGCC ACCAATTGGA 9350 

GGAGTGGCTA TTGATGTGGT GAAAGTAGAA GTCCCTGCGA CGGGCCGAAC 9400 
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ACTCGTGCTG 
GGCTGTACAT 
GAGTGGGAAT 
CTTCGGCTGC 
GATCGGACCA 
CGGTCGGAGC 
GGAGCTCCCG 
CATAGA CGGG 
GGGCTGGTTG 
GCGCGACTAC 
GATGATGCAT 
GGCTCCGGAC 
GGGCAGGAGC 
GGCGGTTGAT 
GTGAGCTTGA 
GACGGCGGCC 
AGGCGATCTC 
CGTCCGGCTC 
GAGCTGCGAG 
CCACGCCCCC 
AGCTCCACGT 
GTAGTTGAGG 
AGCGTCGCAA 
ATGGCCTCGT 
CGACACGGTT 
CGCGCACCTC 



GCTTTTGTAA 
CCTGCACGAG 
TTGAGCCCCT 
TTGTCCTTGA 
CCACGCC GCG 
TTGATGACAA 
CGGCGTCAGG 
TCAGGGCG CG 
GTGGCGGCGT 
GGTACCGCGC 
CTAAAAGCGG 
CCGCCGGGAG 
TGGTGCTGCG 
CTCCTGAATC 
GCCTGAAAGA 
TGGCGCAAAA 
GGCCATGAAC 
GCTCCACGGT 
AAGGCGTTGA 
TTCGGCATCG 
GCCGGGCGAA 
GTGGTGGCGG 
CGTGGATTCG 
AGAAGTCCAC 
AACTCCTCCT 
GCGCTCAAAG 
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AAACGTGCGC 
GTTGACCTGA 
CGCCTGGCGG 
CCGTCTGGCT 
CGAGCCCAAA 
CATCGCGCAG 
TCAGGCGGGA 
GGCTAGATCC 
CGATGGCTTG 
GGCGGGCGGT 
TGA CGCGGGC 
AGGGGGCAGG 
CGCGTAGGTT 
TGGCGCCTCT 
GAGTTCGACA 
TCTCCTGCAC 
TGCTCGATCT 
GGCGGCGAGG 
GGCCTCCCTC 
CGGGCGCGC A 
GACGGCGTAG 
TGTGTTCTGC 
TTGATATCCC 
GGCGAAGTTG 
CCAGAAGACG 
GCTACAGGGG 



AGTACTGGCA 

CGACCGCGCA 

GTTTGGCTGG 

GCTCGAGGGG 

GTCCAGATGT 

ATGGGAGCTG 

GCTCCTGCAG 

AGGTGATACC 

CAAGAGGCCG 

GGGCCGCGGG 

GAGCCCCCGG 

GGCACGTCGG 

GCTGGCGAAC 

GCGTGAAGAC 

GAATCAATTT 

GTCTCCTGAG 

CTTCCTCCTG 

TCGTTGGAAA 

GTTCCAGACG 

TGACCACCTG 

TTTCGCAGGC 

CACGAAGAAG 

CCAAGGCCTC 

AAAAACTGGG 

GATGAGCTCG 

CCTCTTCTTC 



GCGGTGCACG 

CAAGGAAGCA 

TGGTCTTCTA 

AGTTACGGTG 

CCGCGCGCGG 

TCCATGGTCT 

GTTTACCTCG 

TAATTTCCAG 

CATCCCCGCG 

GGTGTCCTTG 

AGGTAGGGGG 

CGCCGCGCGC 

GCGACGACGC 

GACGGGCCCG 

CGGTGTCGTT 

TTGTCTTGAT 

GAGATCTCCG 

TGCGGGCCAT 

CGGCTGTAGA 

CGCGAGATTG 

GCTGAAAGAG 

TACATAACCC 

AAGGCGCTCC 

AGTTGCGCGC 

GCGACAGTGT 

TTCTTCAATC 



9450 

9500 

9550 

9600 

9650 

9700 

9750 

9800 

9850 

9900 

9950 

10000 

10050 

10100 

10150 

10200 

10250 

10300 

10350 

10400 

10450 

10500 

10550 

10600 

10650 



10700 
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TCCTCTTCCA 


TAAGGGCCTC 


CCCTTCTTCT 


TCTTCTGGCG 


GCGGTGGGGG 


10750 


AGGGGGGACA 


CGGCGGCGAC 


GACGGCGCAC 


CGGGAGGCGG 


TCGACAAAGC 


10800 


GCTCGATCAT 


CTCCCCGCGG 


CGACGGCGCA 


TGGTCTCGGT 


GACGGCGCGG 


10850 


CCGTTCTCGC 


GGGGGCGCAG 


TTGGAAGACG 


CCGCCCGTCA 


TGTCCCGGTT 


10900 


ATGGGTTGGC 


GGGGGGCTGC 


CATGCGGCAG 


GGATACGGCG 


CTAACGATGC 


10950 


ATCTCAACAA 


TTGTTGTGTA 


GGTACTCCGC 


CGCCGAGGGA 


CCTGAGCGAG 


“TlOOO 


TCCGCATCGA 


CCGGATCGGA 


AAACCTCTCG 


AGAAAGGCGT 


CTAACCAGTC 


11050 


ACAGTCGCAA 


GGTAGGCTGA 


GCACCGTGGC 


GGGCGGCAGC 


GGGCGGCGGT 


11100 


CGGGGTTGTT 


TCTGGCGGAG 


GTGCTGCTGA 


TGATGTAATT 


AAAGTAGGCG 


11150 


GTCTTGAGAC 


GGCGGATGGT 


CGACAGAAGC 


ACCATGTCCT 


TGGGTCCGGC 


11200 


CTG CTGAATG 


CGCAGGCGGT 


CGGCCATGCC 


CCAGGCTTCG 


TTTTGACATC 


11250 


GGCGCAGGTC 


TTTGTAGTAG 


TCTTGCATGA 


GCCTTTCTAC 


CGGCACTTCT 


11300 


TCTTCTCCTT 


CCTCTTGTCC 


TGCATCTCTT 


GCATCTATCG 


CTGCGGCGGC 


11350 


GGCGGAGTTT 


GGCCGTAGGT 


GGCGCCCTCT 


TCCTCCCATG 


CGTGTGACCC 


11400 


CGAAGCCCCT 


CATCGGCTGA 


AGCAGGG CT A 


GGTCGGCGAC 


AACGCGCTCG 


11450 


GCTAATATGG 


CCTGCTGCAC 


CTGCGTGAGG 


GTAGACTGGA 


AGTCATCCAT 


11500 


GTCCACAAAG 


CGGTGGTATG 


CGCCCGTGTT 


GATGGTGTAA 


GTGCAGTTGG 


11550 


CCATAACGGA 


CCAGTTAACG 


GTCTGGTGAC 


CCGGCTGCGA 


GAGCTCGGTG 


11600 


TACCTGAGAC 


GCGAGTAAGC 


CCTCGAGTCA 


AATACGTAGT 


CGTTGCAAGT 


11650 


CCGCACCAGG 


TACTGGTATC 


CCACCAAAAA 


GTGCGGCGGC 


GGCTGGCGGT 


11700 


AGAGGGGCCA 


GCGTAGGGTG 


GCCGGGGCTC 


CGGGGGCGAG 


ATCTTCCAAC 


11750 


ATAAGGCGAT 


GATATCCGTA 


GATGTACCTG 


GACATCCAGG 


TGATGCCGGC 


11800 


GGCGGTGGTG 


GAGGCGCGCG 


GAAAGTCGCG 


GACGCGGTTC 


CAGATGTTGC 


11850 


GCAGCGGCAA 


AAAGTGCTCC 


ATGGTCGGGA 


CGCTCTGGCC 


GGTCAGGCGC 


11900 


GCGCAATCGT 


TGACGCTCTA 


GACCGTGCAA 


AAGGAGAGCC 


TGTAAGCGGG 


11950 


CACTCTTCCG 


TGGTCTGGTG 


GATAAATTCG 


CAAGGGTATC 


ATGGCGGACG 


12000 
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ACCGGGGTTC 


GAGCCCCGTA 


TCCGGCCGTC 


CGCCGTGATC 


CATG CGGTTA 


12050 


CCGCCCGCGT 


GTCGAACCCA 


GGTGTGCGAC 


GTCAGACAAC 


GGGGGAGTGC 


12100 


TCCTTTT GGC 


TTCCTTCCAG 


GCGCGGCGGC 


TGCTGCGCTA 


gcttttttgg 


12150 


CCACTGGCCG 


CGCGCAGCGT 


AAGCGGTTAG 


GCTGGAAAGC 


GAAAG CATTA 


12200 


AGTGGCTCGC 


TCCCTGTAGC 


CGGAGGGTTA 


TTTTCCAAGG 


GTTGAGTCGC 


12250 


GGGACCCCCG 


GTTCGAGTCT 


CGGACCGGCC 


GGACTGCGGC 


GAACGGGGGT 


12300 


TTGCCTCCCC 


GTCATGCAAG 


ACCCCGCTTG 


CAAATTCCTC 


CGGAAACAGG 


12350 


GACGAGCCCC 


TTTTTTGCTT 


TTCCCAGATG 


CATCCGGTGC 


TGCGGCAGAT 


12400 


GCGCCCCCCT 


CCTCAGCAGC 


GGCAAGAGCA 


AGAGCAGCGG 


CAGACATGCA 


12450 


GGGC ACCCTC 


CCCTCCTCCT 


ACCGCGTCAG 


GAGGGGCGAC 


ATCCGCGGTT 


12500 


GACGCGGCAG 


CAGATGGTGA 


TTACGAACCC 


CCGCGGCGCC 


GGGCCCGGCA 


12550 


CTACCTGGAC 


TTGGAGGAGG 


GCGAGGGCCT 


GGCGCGGCTA 


GGAGCGCCCT 


12600 


CTCCTGAGCG 


GTACCCAAGG 


GTGCAGCTGA 


AGCGTGATAC 


GCGTGAGGCG 


12650 


TACGTGCCGC 


GGCAGAACCT 


GTTT CGCGAC 


CGCGAGGGAG 


AGGAGCCCGA 


12700 


GGAGATGCGG 


GATCGAAAGT 


TCCACGCAGG 


GCGCGAGCTG 


CGGCATGGCC 


12750 


TGAATCGCGA 


GCGGTTGCTG 


CGCGAGGAGG 


ACTTTGAGCC 


CGACGCGCGA 


12800 


AC CGGG ATT A 


GTCCCGCGCG 


CGCACACGTG 


GCGGCCGCCG 


ACCTGGTAAC 


12850 


CGCATACGAG 


CAGACGGTGA 


ACCAGGAGAT 


T AACTTT CAA 


AAAAGCTTTA 


12900 


ACAACCACGT 


GCGTACGCTT 


GTGGCGCGCG 


AGGAGGTGGC 


TATAGGACTG 


12950 


ATGCATCTGT 


GGGACTTTGT 


AAGCGCGCTG 


GAGCAAAACC 


CAAATAGCAA 


13000 


GCCGCTCATG 


GCGCAGCTGT 


TCCTTATAGT 


G CAG CACAGC 


AGGGACAACG 


13050 


AGGCATTCAG 


GGATGCGCTG 


CTAAACATAG 


TAGAGCCCGA 


GGGCCGCTGG 


13100 


CTGCTCGATT 


T GAT AAA CAT 


CCTGCAGAG C 


ATAGTGGTGC 


AGGAGCGCAG 


13150 


CTTGAGCCTG 


GCTGACAAGG 


TGGCCGCCAT 


CAACTATTCC 


ATGCTTAGCC 


13200 


TGGGCAAGTT . 


TTACGCCCGC 


AAGATATACC 


ATACCCCTTA 


CGTTCCCATA 


13250 


GACAAGGAGG 


TAAAGATCGA 


GGGGTTCTAC 


ATGCGCATGG 


CGCTGAAGGT 


13300 
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GCTTACCTTG 


AGCGACGACC 


TGGGCGTTTA 


TCGCAACGAG 


CGCATCCACA 


13350 


AGGCCGTGA G 


CGTGAGCCGG 


CGGCGCGAGC 


TCAGCGACCG 


CGAGCTGATG 


13400 


CACAGCCTGC 


AAAGGGCCCT 


GGCTGGCACG 


GGCAGCGGCG 


ATAGAGAGGC 


13450 


CGAGTCCTAC 


TTTGACGCGG 


GCGCTGACCT 


GCGCTGGGCC 


CCAAGCCGAC 


13500 


GCGCCCTGGA 


GGCAGCTGGG 


GCCGGACCTG 


GGCTGGCGGT 


GGCACCCGCG 


13550 


CGCGCTGGCA 


ACGTCGGCGG 


CGTGGAGGAA 


TATGACGAGG 


ACGATGAGTA 


13600 


CGAGCCAGAG 


GACGGCGAGT 


ACTAAGCGGT 


GATGTTTCTG 


ATCAGATGAT 


13650 


GCAAGACGCA 


ACGGACCCGG 


CGGTGCGGGC 


GGCGCTGCAG 


AGCCAGCCGT 


13700 


CCGGCCTTAA 


CTCCACGGAC 


GACTGGCGCC 


AGGTCATGGA 


CCGCATCATG 


13750 


TCGCTGACTG 


CGCGCAATCC 


TGACGCGTTC 


CGGCAGCAGC 


CGCAGGCCAA 


13800 


CCGGCTCTCC 


GCAATTCTGG 


AAGCGGTGGT 


CCCGGCGCGC 


GCAAACCCCA 


13850 


CGCACGAGAA 


GGTGCTGGCG 


ATCGTAAACG 


CGCTGGCCGA 


AAACAGGGCC 


13900 


ATCCGGCCCG 


ACGAGGCCGG 


CCTGGTCTAC 


GACGCGCTGC 


TTCAGCGCGT 


13950 


GGCTCGTTAC 


AACAGCGGCA 


ACGTGCAGAC 


CAACCTGGAC 


CGGCTGGTGG 


14000 


GGGATGTGCG 


CGAGGCCGTG 


GC G CAGCGTG 


AGCGCGCGCA 


GCAGCAGGGC 


14050 


AACCTGGGCT 


CCATGGTTGC 


ACTAAACGCC 


TTC CTGAGTA 


CACAGCCCGC 


14100 


CAACGTGCCG 


CGGGGACAGG 


AGGACTACAC 


CAACTTTGTG 


AGCGCACTGC 


14150 


GGCTAATGGT 


GACTGAGACA 


CCGCAAAGTG 


AGGTGTACCA 


GTCTGGGCCA 


14200 


GACTATTTTT 


TCCAGACCAG 


TAGACAAGGC 


CTGCAGACCG 


TAAACCTGAG 


14250 


CCAGGCTTTC 


AAAAACTTGC 


AGGGGCTGTG 


GGGGGTGCGG 


GCTCCCACAG 


14300 


GCGACCGCGC 


GACCGTGTCT 


AGCTTGCTGA 


CGCCCAACTC 


GCGCCTGTTG 


14350 


CTGCTGCTAA 


TAGCGCCCTT 


CACGGACAGT 


GGCAGCGTGT 


CGCGGGACAC 


14400 


ATACCTAGGT 


CACTTGCTGA 


CACTGTACCG 


CGAGGCCATA 


GGTCAGGCGC 


14450 


ATGTGGACGA 


GCATACTTTC 


CAGGAGATTA 


CAAGTGTCAG 


CCGCGCGCTG 


14500 


GGGCAGGAGG 


ACACGGGCAG 


CCTGGAGGCA 


ACCCTAAACT 


ACCTGCTGAC 


14550 


CAACCGGCGG 


CAGAAGATCC 


CCTCGTTGCA 


CAGTTTAAAC 


AGCGAGGAGG 


14600 
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AGCGCATTTT 

GACGGGGTAA 

ACCGGGCATG 

ACTTGCATCG 

TTGAACCCGC 

GGTGCCCGAG 

TGTTTTCCCC 

GCAGAGGCGG 

CGATCTAGGC 

GCTTGATAGG 

GGCGAGGAGG 

AAACCTGCCT 

AGATGAGTAG 

CCGCGCCCGC 

GTGGGAGGAC 

GGAGTGGCAA 

TAAAAAAAAA 

ACCGAGCGTT 

GTATGAGGAA 

CAGTGGCGGC 

TTTGTGCCTC 

TTACTCTGAG 

ACAACAAGTC 

AACTTTCTGA 



GCGCTACGTG 

CGCCCAGCGT 

TATGCCTCAA 

CGCGGCCGCC 

ACTGGCTACC 

GGTAACGATG 

GCAACCGCAG 

CGCTGCGAAA 

GCTGCGGCCC 

GTCTCTTACC 

AGTACCTAAA 

CCGGCATTTC 

ATGGAAGACG 

CCACCCGTCG 

GATGACTCGG 

CCCGTTTGCG 

AAAGCATGAT 

GGTTTTCTTG 

GGTCCTCCTC 

GGCGCTGGGT 

CGCGGTACCT 

TTGGCACCCC 

AACGGATGTG 

CCACGGTCAT 



85 

CAG C AG AG CG 

GGCGCTGGAC 

ACCGGCCGTT 

GTGAACCCCG 

GCCCCCTGGT 

GATTCCTCTG 

ACCCTGCTAG 

GGAAAGCTTC 

CGCGGTCAGA 

AGCACTCGCA 

CAACTCGCTG 

CCAACAACGG 

TACGCGCAGG 

TCAAAGGCAC 

CAGACGACAG 

CACCTTCGCC 

GCAAAATAAA 

TATTCCCCTT 

CCTCCTACGA 

TCTCCCTTCG 

GCGGCCTACC 

TATTCGACAC 

GCATCCCTGA 

TCAAAACAAT 



TGAGCCTTAA 

ATGACCGCGC 

TATCAACCGC 

AGTATTTCAC 

TTCTACACCG 

GGACGACATA 

AGTTGCAACA 

CGCAGGCCAA 

TGCTAGTAGC 

CCACCCGCCC 

CTGCAGCCGC 

GATAGAGAGC 

AGCACAGGGA 

GAC CGTCAGC 

CAGCGTCCTG 

CCAGGCTGGG 

AAACTCACCA 

AGTATGCGG C 

GAGTGTGGTG 

ATGCTCCCCT 

GGGGGGAGAA 

CACCCGTGTG 

ACTACCAGAA 

GACTACAGCC 



CCTGATGCGC 

GCAACATGGA 

CTAATGGACT 

CAATGC CATC 

GGGGATTCGA 

GACGACAGCG 

GCGCGAGCAG 

GCAGCTTGTC 

CCATTTCCAA 

GCGCCTGCTG 

AGCGCGAAAA 

CTAGTGGACA 

CGTGCCAGGC 

GGGGTCTGGT 

GATTTGGGAG 

GAGAATGTTT 

AGGCCATGGC 

GCGCGGCGAT 

AGCGCGGCGC 

GGACCCGCCG 

ACAGCATCCG 

TACCTGGTGG 

CGACCACAGC 

CGGGGGAGGC 



14650 

14700 

14750 

14800 

14850 

14900 

14950 

15000 

15050 

15100 

15150 

15200 

15250 

15300 

15350 

15400 

15450 

15500 

15550 

15600 

15650 

15700 

15750 

15800 

15850 



AAGCACACAG ACCATCAATC TTGACGACCG GTCGCACTGG GGCGGCGACC 
TGAAAACCAT CCTGCATACC AACATGCCAA ATGTGAACGA GTTCATGTTT 



15900 




WO 96/39530 



PCT/US96/1 0245 



86 

ACCAATAAGT TTAAGGCGCG GGTGATGGTG TCGCGCTTGC CTACTAAGGA 
CAATCAGGTG GAGCTGAAAT ACGAGTGGGT GGAGTTCACG CTGCCCGAGG 
GCAACTACTC CGAGACCATG ACCATAGACC TTATGAACAA CGCGATCGTG 
GAGCACTACT TGAAAGTGGG CAGACAGAAC GGGGTTCTGG AAAGCGACAT 
CGGGGTAAAG TTTGACACCC GCAACTTCAG ACTGGGGTTT GACCCCGTCA 
CTGGTCTTGT CATGCCTGGG GTATATACAA ACGAAGCCTT CCATCCAGAC 
ATCATTTTGC TGCCAGGATG CGGGGTGGAC TTCACCCACA GCCGCCTGAG 
CAACTTGTTG GGCATCCGCA AGCGGCAACC CTTCCAGGAG GGCTTTAGGA 
TCACCTACGA TGATCTGGAG GGTGGTAACA TTCCCGCACT GTTGGATGTG 
GACGCCTACC AGGCGAGCTT GAAAGATGAC ACCGAACAGG GCGGGGGTGG 
CGCAGGCGGC AGCAACAGCA GTGGCAGCGG CGCGGAAGAG AACTCCAACG 
CGGCAGCCGC GGCAATGCAG CCGGTGGAGG ACATGAACGA TCATGCCATT 
CGCGGCGACA CCTTTGCCAC ACGGGCTGAG GAGAAGCGCG CTGAGGCCGA 
AGCAGCGGCC GAAGCTGCCG CCCCCGCTGC GCAACCCGAG GTCGAGAAGC 
CTCAGAAGAA ACCGGTGATC AAACCCCTGA CAGAGGACAG CAAGAAACGC 
AGTTACAACC TAATAAGCAA TGACAGCACC TTCACCCAGT ACCGCAGCTG 
GTACCTTGCA TACAACTACG GCGACCCTCA GACCGGAATC CGCTCATGGA 
CCCTGCTTTG CACTCCTGAC GTAACCTGCG GCTCGGAGCA GGTCTACTGG 
TCGTTGCCAG ACATGATGCA AGACCCCGTG ACCTTCCGCT CCACGCGCCA 
GATCAGCAAC TTTCCGGTGG TGGGCGCCGA GCTGTTGCCC GTGCACTCCA 
AGAGCTTCTA CAACGACCAG GCCGTCTACT CCCAACTCAT CCGCCAGTTT 
ACCTCTCTGA CCCACGTGTT CAATCGCTTT CCCGAGAACC AGATTTTGGC 
GCGCCCGCCA GCCCCCACCA TCACCACCGfT CAGTGAAAAC GTTCCTGCTC 
TCACAGATCA CGGGACGCTA CCGCTGCGCA ACAGCATCGG AGGAGTCCAG 
CGAGTGACCA TTACTGACGC CAGACGCCGC ACCTGCCCCT ACGTTTACAA 
GGCCCTGGGC ATAGTCTCGC CGCGCGTCCT ATCGAGCCGC ACTTTTTGAG 



15950 

16000 

16050 

16100 

16150 

16200 

16250 

16300 

16350 

16400 

16450 

16500 

16550 

16600 

16650 

16700 

16750 

16800 

16850 

16900 

16950 

17000 

17050 

17100 

17150 



17200 
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CAAGCATGTC 


CATCCTTATA 


TCGCCCAGCA 


ATAACACAGG 


CTGGGGCCTG 


17250 


CGCTTCCCAA 


GCAAGATGTT 


TGGCGGGGCC 


AAGAAGCGCT 


CCGACCAACA 


17300 


CCCAGTGCGC 


GTGCGCGGGC 


ACTACCGCGC 


GCCCTGGGGC 


GCGCACAAAC 


17350 


GCGGCCGCAC 


TGGGCGCACC 


ACCGTCGATG 


ACGCCATCGA 


CGCGGTGGTG 


17400 


GAGGAGGCGC 


GCAACTACAC 


GCCCACGCCG 


CCACCAGTGT 


CCACAGTGGA 


17450 


CGCGGCCATT 


CAGACCGTGG 


TGCGCGGAGC 


CCGGCGCTAT 


GCTAAAATGA 


17500 


AGAGACGGCG 


GAGGCGCGTA 


GCACGTCGCC 


ACCGCCGCCG 


ACCCGGCACT 


17550 


GCCGCCGAAC 


GCGCGGCGGC 


GGCCCTGCTT 


AACCGCGCAC 


GTCGCACCGG 


17600 


CCGACGGGCG 


GCCATGCGGG 


CCGCTCGAAG 


GCTGGCCGCG 


GGTATTGTCA 


17650 


CTGTGCCCCC 


CAGGTCCAGG 


CGACGAGCGG 


CCGCCGCAGC 


AGCCGCGGCC 


17700 


ATTAGTG CTA 


TGACTCAGGG 


TCGCAGGGGC 


AACGTGTATT 


GGGTGCGCGA 


17750 


CTCGGTTAGC 


GGCCTGCGCG 


TGCCCGTGCG 


CACCCGCCCC 


CCGCGCAACT 


17800 


AGATTGCAAG 


AAAAAACTAC 


TTAGACTCGT 


ACTGTTGTAT 


GTATCCAGCG 


17850 


GCGGCGGCGC 


GCAACGAAGC 


TATGT CCAAG 


CGCAAAATCA 


AAGAAGAGAT 


17900 


GCTCCAGGTC 


ATCGCGCCGG 


AGATCTATGG 


CCCCCCGAAG 


AAGGAAGAGC 


17950 


AGGATTACAA 


GCCCCGAAAG 


CTAAA GCGGG 


TCAAAAAGAA 


AAAGAAAGAT 


18000 


GATGATGATG 


AACTTGACGA 


CGAGGTGGAA 


CTGCTGCACG 


GTACCGCGCC 


18050 


CAGGCGACGG 


GTACAGTGGA 


AAGGTCGACG 


CGTAAAACGT 


GTTTTGCGAC 


18100 


CCGGCACCAC 


CGTAGTCTTT 


ACGCCCGGTG 


AGCGCTCCAC 


CCGCACCTAC 


18150 


AAGCGCGTGT 


ATGATGAGGT 


GTACGGCGAC 


GAGGACCTGC 


TTGAGCAGGC 


18200 


CAACGAGCGC 


CTCGGGGAGT 


TTGCCTACGG 


AAAGCGG CAT 


AAGGACATGC 


18250 


TGGCGTTGCC 


GCTGGACGAG 


GGCAACCCAA 


CACCTAGCCT 


AAAGCCCGTA 


18300 


ACACTGCAGC 


AGGTGCTGCC 


CGCGCTTGCA 


CCGTCCGAAG 


AAAAGCGCGG 


18350 


CCTAAAGCGC 


GAGTCTGGTG 


ACTTGGCACC 


CACCGTGCAG 


CTGATGGTAC 


18400 


CCAAGCGCCA 


GCGACTGGAA 


GATGTCTTGG 


AAAAAATGAC 


CGTGGAACCT 


18450 


GGGCTGGAGC 


CCGAGGTCCG 


CGTGCGGCCA 


ATCAAGCAGG 


TGGCGCCGGG 


18500 
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ACTGGGCGTG CAGACCGTGG ACGTTCAGAT ACCCACTACC AGTAGCACCA 
GTATTGCCAC CGCCACAGAG GGCATGGAGA CACAAACGTC CCCGGTTGCC 
TCAGCGGTGG CGGATGCCGC GGTGCAGGCG GTCGCTGCGG CCGCGTCCAA 
GACCTCTACG GAGGTGCAAA CGGACCCGTG GATGTTTCGC GTTTCAGCCC 
CCCGGCGCCC GCGCGGTTCG AGGAAGTACG GCGCCGCCAG CGCGCTACTG 
CCCGAATATG CCCTACATCC TTCCATTGCG CCTACCCCCG GCTATCGTGG 
CTACACCTAC CGCCCCAGAA GACGAGCAAC TACCCGACGC CGAACCACCA 
CTGGAACCCG CCGCCGCCGT CGCCGTCGCC AGCCCGTGCT GGCCCCGATT 
TCCGTGCGCA GGGTGGCTCG CGAAGGAGGC AGGACCCTGG TGCTGCCAAC 
AGCGCGCTAC CACCCCAGCA TCGTTTAAAA GCCGGTCTTT GTGGTTCTTG 
CAGATATGG C CCTCACCTGC CGCCTCCGTT TCCCGGTGCC GGGATTCCGA 
GGAAGAATGC ACC GT AG GAG GGGCATGGCC GGCCACGGCC TGACGGGCGG 
CATGCGTCGT GCGCACCACC GGCGGCGGCG CGCGTCGCAC CGTCGCATGC 
GCGGCGGTAT CCTGCCCCTC CTTATTCCAC TGATCGCCGC GGCGATTGGC 
GCCGTGCCCG G AATTG CAT C CGTGGCCTTG CAGGCGCAGA GACACTGATT 
AAAAACAAGT TGCATGTGGA AAAATCAAAA TAAAAAGTCT GGACTCTCAC 
GCTCGCTTGG TCCTGTAACT ATTTTGTAGA ATGGAAGACA TCAACTTTGC 
GTCTCT GGCC CCGCGACACG GCTCGCGCCC GTTCATGGGA AACTGGCAAG 
ATATCGGCAC CAGCAATATG AGCGGTGGCG CCTTCAGCTG GGGCTCGCTG 
TGGAGCGGCA TTAAAAATTT CGGTTCCACC GTTAAGAACT ATGGCAGCAA 
GGCCTGGAAC AGCAGCACAG GCCAGATGCT GAGGGATAAG TTGAAAGAGC 
AAAATTTCCA ACAAAAGGTG GTAGATGGCC TGGCCTCTGG CATTAGCGGG 
GTGGTGGACC TGGCCAACCA GGCAGTGCAA AATAAGATTA ACAGTAAGCT 
TGATCCCCGC CCTCCCGTAG AGGAGCCTCC ACCGGCCGTG GAGACAGTGT 
CTCCAGAGGG GCGTGGCGAA AAGCGTCCGC GCCCCGACAG GGAAGAAACT 
CTGGTGACGC AAATAGACGA GCCTCCCTCG TACGAGGAGG CACTAAAGCA 
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AGGCCTGCCC 
GCCAGCACAC 
CAGAAACCTG 
CCGCGCGTCC 
TAGCCAGTGG 
GTGCAATCCC 
TGTGTGTCAT 
CGCGCGCCCG 
CTTACATGCA 
CTGGTG CAGT 
GTTTAGAAAC 
CCCAGCGTTT 
TACTCGTACA 
GGACATGGCT 
CTACTTTTAA 
GGTGCCCCAA 
AAACCTAGAA 
CTGAGCAGCA 
AATATTACAA 
ATATGCCGAT 
GGTACGAAAC 
ACCCCAATGA 
TGGAGGGCAA 
AAGTGGAAAT 
GATAACTTGA 
AACCCCAGAC 



ACCACCCGTC 
ACCCGTAACG 
TGCTGCCAGG 
CTGCGCCGCG 
CAACTGGCAA 
TGAAGCGCCG 
GTATGCGTCC 
CTTTCCAAGA 
CATCTCGGGC 
TTGCCCGCGC 
CCCACGGTGG 
GACGCTGCGG 
AGGCGCGGTT 
T C C ACGTACT 
GCCCTACTCT 
ATCCTTGCGA 
GAAGAGGACG 
AAAAACTCAC 
AGGAGGGTAT 
AAAACATTTC 
TGAAATTAAT 
AACCATGTTA 
GGCATTCTTG 
GCAATTTTTC 
CTCCTAAAGT 
ACTCATATTT 
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CCATCGCGCC 
CTGGACCTGC 
CCCGACCGCC 
CCGCCAGCGG 
AGCACACTGA 
ACGATGCTTC 
ATGTCGCCGC 
TGGCTACCCC 
C AGGACG CCT 
CACCGAGACG 
CGCCTACGCA 
TTCATCCCTG 
CACCCTAGCT 
TTGACATCCG 
GGCACTGCCT 
ATGGGATGAA 
ATGACAACGA 
GTATTTGGGC 
TCAAATAGGT 
AACCTGAACC 
CATGCAGCTG 
CGGTT CAT AT 
TAAAG C AAC A 
TCAACTACTG 
GGTATTGTAC 
CTTACATGCC 



CATGGCTACC 
CTCCCCCCGC 
GTTGTTGTAA 
TCCGCGATCG 
ACAGCATCGT 
TGAATAGCTA 
C AG AGG AG CT 
TTCGATGATG 
CGGAGTACCT 
TACTTCAGCC 
CGACGTGACC 
TGGACCGTGA 
GTGGGTGATA 
CGGCGTGCTG 
ACAACGCCCT 
GCTGCTACTG 
AGACGAAGTA 
AGGCGCCTTA 
GTCGAAGGTC 
TCAAATAGGA 
GGAGAGTCCT 
GCAAAACCCA 
AAATGGAAAG 
AGGCGACCGC 
AGTGAAGATG 
CACTATTAAG 



GGAGTGCTGG 
CGACACCCAG 
CCCGTCCTAG 
TTGCGGCCCG 
GGGTCTGGGG 
ACGTGTCGTA 
GCTGAGCCGC 
CCGCAGTGGT 
GAGCCCCGGG 
TGAATAACAA 
ACAGACCGGT 
GGATACTGCG 
ACCGTGTGCT 
GACAGGGGCC 
GGCTCCCAAG 
CTCTTGAAAT 
GACGAGCAAG 
TTCTGGTATA 
AAACACCTAA 
GAATCTCAGT 
TAAAAAGACT 
CAAATGAAAA 
CTAGAAAGTC 
AGG CAATGGT 
TAGATATAGA 
GAAGGTAACT 
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CACGAGAACT 

GCTTTTAGGG 

TATGGGTGTT 

TGCAAGACAG 

GGTGATAGAA 

TGATCCAGAT 

CAAATTACTG 

AAGGTAAAAC 

AGAATTTTCA 

AAATCAATCT 

GCGCTGTATT 

TTCTGATAAC 

CCGGGTTAGT 

TATATGGACA 

CTACCGCTCA 

AGGTGCCTCA 

TCATACACCT 

GAGCTCCCTA 

ATAGCATTTG 

TCCACGCTTG 

CGACTATCTC 

CCAACGTGCC 

TGGGCCTTCA 

CTACGACCCT 

CCTTTTACCT 

TCTGTCAGCT 



AATGGGCCAA 

ACAATTTTAT 

CTGGCGGGCC 

AAACACAGAG 

CCAGGTACTT 

GTTAGAATTA 

CTTTCCACTG 

CTAAAACAGG 

GATAAAAATG 

AAATGCCAAC 

TGCCCGACAA 

CCAAACACCT 

GGACTGCTAC 

ACGTCAACCC 

ATGTTGCTGG 

GAAGTTCTTT 

ACGAGTGGAA 

GGAAATGACC 

CCTTTACGCC 

AGGCCATGCT 

TCCGCCGCCA 

CATATCCATC 

CGCGCCTTAA 

TATTACAC CT 

CAACCACACC 

GGCCTGGCAA 
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CAATCTATGC 

TGGTCTAATG 

AAGCATCGCA 

CTTTCATACC 

TTCTATGTGG 

TTGAAAATCA 

GGAGGTGTGA 

T CAGGAAAAT 

AAATAAGAGT 

CTGTGGAGAA 

GCTAAAGTAC 

ACG A CTACAT 

ATTAACCTTG 

ATTTAACCAC 

GCAATGGTCG 

GCCATTAAAA 

CTTCAGGAAG 

TAAGGGTTGA 

ACCTTCTTCC 

TAGAAACGAC 

ACATGCTCTA 

CCCTCCCGCA 

GACTAAGGAA 

ACTCTGGCTC 

TTTAAGAAGG 

TGACCGCCTG 



CCAACAGGCC 

TATTACAACA 

GTTGAATGCT 

AGCTTTTGCT 

AATCAGGCTG 

TGGAACTGAA 

TTAATACAGA 

GGATGGGAAA 

TGGAAATAAT 

ATTTCCTGTA 

AGTCCTTCCA 

GAAGAAGCGA 

GAGCACGCTG 

CACCGCAATG 

CTATGTG CCC 

ACCTCCTTCT 

GATGTTAACA 

CGGAGCCAGC 

CCATGGCCCA 

ACCAACGACC 

CCCTATACCC 

ACTGGGCGGC 

ACCCCATCAC 

TATACCCTAC 

TGGCCATTAC 

CTTACCCCCA 



TAATTACATT 

GCACGGGTAA 

GTTGTAGATT 

TGATTCCATT 

TTGACAGCTA 

GATGAACTTC 

GACTCTTACC 

AAGATGCTAC 

TTTGCCATGG 

CTCCAACATA 

ACGTAAAAAT 

GTGGTGGCTC 

GTCCCTTGAC 

CTGGCCTGCG 

TTCCACATCC 

CCTGCCGGGC 

TGGTTCTGCA 

ATTAAGTTTG 

CAACACCGCC 

AGTCCTTTAA 

GCCAACGCTA 

TTTCCGCGGC 

TGGGCTCGGG 

CTAGATGGAA 

CTTTGACTCT 

ACGAGTTTGA 
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AATTAAGCGC TCAGTTGACG GGGAGGGTTA 
TGACCAAAGA CTGGTTCCTG GTACAAATGC 
TACCAGGGCT TCTATATCCC AGAGAGCTAC 
CTTTAGAAAC TTCCAGCCCA TGAGCCGTCA 
ACAAGGACTA CCAACAGGTG GGCATCCTAC 
TTTGTTGGCT ACCTTGCCCC CACCATGCGC 
TAACTTCCCC TATCCGCTTA TAGGCAAGAC 
AGAAAAAGTT TCTTTGCGAT CGCACCCTTT 
AACTTTATGT CCATGGGCGC ACTCACAGAC 
CGCCAACTCC GCCCACGCGC TAGACATGAC 
ACGAGCCCAC CCTTCTTTAT GTTTTGTTTG 
GTGCACCGGC CGCACCGCGG CGTCATCGAA 
CTTCTCGGCC GGCAACGCCA CAACATAAAG 
AGCTGCCGCC ATGGGCTCCA GTGAGCAGGA 
ATCTTGGTTG TGGGCCATAT TTTTTGGGCA 
GGCTTTGTTT CTCCACACAA GCTCGCCTGC 
TCGCGAGACT GGGGGCGTAC ACTGGATGGC 
CAAAAACATG CTACCTCTTT GAGCCCTTTG 
AAGCAGGTTT AC CAGTTTGA GTACGAGTCA 
TGCTTCTTCC CCCGACCGCT GTATAACGCT 
TACAGGGGCC CAACTCGGCC GCCTGTGGAC 
CACGCCTTTG CCAACTGGCC CCAAACTCCC 
GAACCTTATT ACCGGGGTAC CCAACTCCAT 
AGCCCACCCT GCGTCGCAAC CAGGAACAGC 
CACTCGCCCT ACTTCCGCAG CCACAGTGCG 
TTTTTGTCAC TTGAAAAACA TGTAAAAATA 



CAACGTTGCC CAGTGTAACA 22450 
TAGCTAACTA CAACATTGGC 22500 
AAGGACCGCA TGTACTCCTT 22550 
GGTGGTGGAT GATACTAAAT 22600 
ACCAACACAA C AACT CTGGA 22650 
GAAGGACAGG CCTACCCTGC 22700 
CGCAGTTGAC AGCATTACCC 22750 
GGCGCATCCC ATTCTCCAGT 22800 
CTGGGCCAAA ACCTTCTCTA 22850 
TTTTGAGGTG GATCCCATGG 22900 
AAGTCTTTGA CGTGGTCCGT 22950 
ACCGTGTACC TGCGCACGCC 23000 
AAGCAAGCAA CATCAACAAC 23050 
ACTGAAAGCC ATTGTCAAAG 23100 
CCTATGACAA GCGCTTTCCA 23150 
GCCATAGTCA ATACGGCCGG 23200 
CTTTGCCTGG AACCCGCACT 23250 
GCTTTTCTGA CCAGCGACTC 23300 
CTCCTGCGCC GTAGCGCCAT 23350 
GGAAAAGTCC ACCCAAAGCG 23400 
TATTCTGCTG CATGTTTCTC 23450 
ATGGATCACA ACCCCACCAT 23500 
GCTCAACAGT CCCCAGGTAC 23550 
TCTACAGCTT CCTGGAGCGC 23600 
CAGATTAGGA GCGCCACTTC 23650 
ATGTACTAGA GACACTTTCA 23700 
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ATAAAGGCAA ATGCTTTTAT TTGTACACTC TCGGGTGATT ATTTACCCCC 
ACCCTTGCCG TCTGCGCCGT TTAAAAATCA AAGGGGTTCT GCCGCGCATC 
GCTATGCGCC ACTGGCAGGG ACACGTTGCG ATACTGGTGT TTAGTGCTCC 
ACTTAAACTC AGGCACAACC ATCCGCGGCA GCTCGGTGAA GTTTTCACTC 
CACAGGCTGC GCACCATCAC CAACGCGTTT AGCAGGTCGG GCGCCGATAT 
CTTGAAGTCG CAGTTGGGGC CTCCGCCCTG CGCGCGCGAG TTGCGATACA 
CAGGGTTGCA GCACTGGAAC ACTATCAGCG CCGGGTGGTG CACGCTGGCC 
AGCACGCTCT TGTCGGAGAT CAGATCCGCG TCCAGGTCCT CCGCGTTGCT 
CAGGGCGAAC GGAGTCAACT TTGGTAGCTG CCTTCCCAAA AAGGGCGCGT 
GCCCAGGCTT TGAGTTGCAC TCGCACCGTA GTGGCATCAA AAGGTGACCG 
TGCCCGGTCT GGGCGTTAGG ATACAGCGCC TGCATAAAAG CCTTGATCTG 
CTTAAAAGCC ACCTGAGCCT TTGCGCCTTC AGAGAAGAAC ATGCCGCAAG 
ACTTGCCGGA AAACTGATTG GCCGGACAGG CCGCGTCGTG CACGCAGCAC 
CTTGCGTCGG TGTTGGAGAT CTGCACCACA TTTCGGCCCC ACCGGTTCTT 
CACGATCTTG GCCTTGCTAG ACTGCTCCTT CAGCGCGCGC TGCCCGTTTT 
CGCTCGTCAC ATCCATTTCA ATCACGTGCT CCTTATTTAT CATAATGCTT 
CCGTGTAGAC ACTTAAGCTC GCCTTCGATC TCAGCGCAGC GGTGCAGCCA 
CAACGCGCAG CCCGTGGGCT CGTGATGCTT GTAGGTCACC TCTGCAAACG 
ACTGCAGGTA CGCCTGCAGG AATCGCCCCA TCATCGTCAC AAAGGTCTTG 
TTGCTGGTGA AGGTCAGCTG CAACCCGCGG TGCTCCTCGT TCAGCCAGGT 
CTTGCATACG GCCGCCAGAG CTTCCACTTG GTCAGGCAGT AGTTTGAAGT 
TCGCCTTTAG AT CGTTAT C C ACGTGGTACT TGTCCATCAG CGCGCGCGCA 
GCCTCCATGC CCTTCTCCCA CGCAGACACG ATCGGCACAC TCAGCGGGTT 
CATCACCGTA ATTTCACTTT CCGCTTCGCT GGGCTCTTCC TCTTCCTCTT 
GCGTCCGCAT ACCACGCGCC ACTGGGTCGT CTTCATTCAG CCGCCGCACT 
GTGCGCTTAC CTCCTTTGCC ATGCTTGATT AGCACCGGTG GGTTGCTGAA 
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ACCCACCATT 

TTACCTCTGG 

TTCTTCTTGG 

GCTGGGTGTG 

CGGACTCGAT 

GGCGGCGACG 

CGCCGCACCG 

GACTGGCCAT 

GAGAAGAAGG 

CACCGATGCC 

TTGAGGAGGA 

GACGACGAGG 

CAACGCAGAG 

GCGACTACCT 

CAGTGCGCCA 

CGCCATAGCG 

GCGTACCCCC 

CTCAACTTCT 

CATCTTTTTC 

GCCGAGCGGA 

ATCGCCTCGC 

CGAGAAGCGC 

GTCACTCTGG 

GTACTAAAAC 

CCTACCCCCC 

GTGCGCAGCC 



TGTAGCGCCA 
TGATGGCGGG 
GCGCAATGGC 
CGCGGCACCA 
ACGCCGCCTC 
GGGACGGGGA 
CGTCCGCGCT 
TTCCTTCTCC 
ACAGCCTAAC 
GCCAACGCGC 
GGAAGTGATT 
ACCGCTCAGT 
GCAAACGAGG 
AGATGTGGGA 
TTATCTGCGA 
GATGTCAGCC 
CAAACGC CAA 
ACCCCGTATT 
CAAAACTGCA 
CAAGGAGCTG 
TCAACGAAGT 
GCGGGAAACG 
AGTGTTGGTG 
GCAGCATCGA 
AAGGTCATGA 
CCTGGAGAGG 
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CATCTTCTCT 
CGCTCGGGCT 
CAAATCCGCC 
GCGCGTCTTG 
ATCCGCTTTT 
CGACACGTCC 
CGGGGGTGGT 
TATAGGCAGA 
CGCCCCCTCT 
CTACCACCTT 
ATCGAGCAGG 
ACCAACAGAG 
AACAAGTCGG 
GACGACGTG C 
CGCGTTGCAA 
TTGCCTACGA 
GAAAACGGCA 
TGCCGTGCCA 
AGATACCCCT 
GCCTTGCGGC 
GCCAAAAATC 
CTCTGCAACA 
GAACTCGAGG 
GGTCACCCAC 
GCACAGTCAT 
GATGCAAATT 



TTCTTCCTCG 

TGGGAGAAGG 

GCCGAGGTCG 

TGATGAGTCT 

TTGGGGGCGC 

TCCATGGTTG 

TTCGCGCTGC 

AAAAGATCAT 

GAGTTCGCCA 

CCCCGTCGAG 

ACCCAGGTTT 

GATAAAAAGC 

GCGGGGGGAC 

TGTTGAAGCA 

GAGCGCAGCG 

ACGCCACCTA 

CATGCGAGCC 

GAGGTGCTTG 

ATCCTGCCGT 

AGGGCGCTGT 

TTTGAGGGTC 

GGAAAACAGC 

GTGACAACGC 

tttgcctacc 

GAGTGAGCTG 

TGCAAGAACA 



CTGTCCACGA 
GCGCTTCTTT 
ATGGCCGCGG 
TCCTCGTCCT 
CCGGGGAGGC 
GGGGACGTCG 
TCCTCTTCCC 
GGAGTCAGTC 
CCACCGCCTC 
GCACCCCCGC 
TGTAAGCGAA 
AAGACCAGGA 
GAAAGGCATG 
TCTGCAGCGC 
ATGTGCCCCT 
TTCTCA CCGC 
CAACCCGCGC 
CCACCTATCA 
GCCAACCGCA 
CATACCTGAT 
TTGGACGCGA 
GAAAATGAAA 
GCGCCTAGCC 
CGGCACTTAA 
ATCGTGCGCC 
AACAGAGGAG 
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GGCCTACCCG 

CGAGCCTGCC 

TCGTTACCGT 

ATGCAGCGCA 

CGTACGCCAG 

CCTACCTTGG 

TCCACGCTCA 

CTTATTTCTA 

G CTTGGAGGA 

TTGAAGGACC 

GGCGGACATC 

TGCCAGACTT 

CTAGAGCGCT 

CTTTGTGCCC 

GCTACCTTCT 

GAAGACGTGA 

ATGCACCCCG 

GTCAAATTAT 

TCCGCGGCTC 

CCTTCGCAAA 

ACGAAGACCA 

ACCCAGGGCC 

AGAGTTTCTG 

GCGAGGAGCT 

CCGCGGGCCC 

CGCCGCCACC 



CAGTTGGCGA 

GACTTGGAGG 

GGAGCTTGAG 

AGCTAGAGGA 

GCCTGCAAGA 

AATTTTGCAC 

AGGGCGAGGC 

TGCTACACCT 

GTGCAACCTC 

TATGGACGGC 

ATTTTCCCCG 

CAC C AGTCAA 

CAGGAATCTT 

ATTAAGTACC 

GCAGCTAGCC 

GCGGTGACGG 

CACCGCTCCC 

CGGTACCTTT 

CGGGGTTGAA 

TTTGTACCTG 

AT CCCGCCCG 

ACATTCTTGG 

CTACGAAAGG 

CAACCCAATC 

TTGCTTCCCA 

CACGGACGAG 



94 

CGAG CAGCTA 

AGCGACGCAA 

TGCATGCAGC 

AACATTGCAC 

TCTCCAACGT 

GAAAACCGCC 

GCGCCGCGAC 

GGCAGACGGC 

AAGGAGCTGC 

CTTCAACGAG 

AACGCCTGCT 

AGCATGTTGC 

GCCCGCCACC 

GCGAATGCCC 

AACTACCTTG 

TCTACTGGAG 

TGGTTTGCAA 

GAGCTGCAGG 

ACTCACTCCG 

AGGACTACCA 

CCAAATGCGG 

CCAATTGCAA 

GACGGGGGGT 

CCCCCGCCGC 

GGATGGCACC 

GAGGAATACT 



GCGCGCTGGC 

ACTAATGATG 

GGTTCTTTGC 

TACACCTTTC 

GGAGCTCTGC 

TTGGGCAAAA 

TACGTCCGCG 

CATGGGCGTT 

AGAAACTGCT 

CGCTCCGTGG 

TAAAACCCTG 

AGAACTTTAG 

TGCTGTGCAC 

TCCGCCGCTT 

CCTACCACTC 

TGTCACTGTC 

TTCGCAGCTG 

GTCCCTCGCC 

GGGCTGTGGA 

CGCCCACGAG 

AGCTTACCGC 

GCCAT CAACA 

TTACTTGGAC 

CGCAGCCCTA 

CAAAAAGAAG 

GGGACAGT CA 



TTCAAACGCG 

GCCGCAGTGC 

TGACCCGGAG 

GACAGGGCTA 

AACCTGGTCT 

CGTGCTTCAT 

ACTGCGTTTA 

TGGCAGCAGT 

AAAGCAAAAC 

CCGCGCACCT 

CAACAGGGTC 

GAACTTTATC 

TTCCTAGCGA 

TGGGGCCACT 

TGACATAATG 

GCTGCAACCT 

CTTAACGAAA 

TGACGAAAAG 

CGTCGGCTTA 

ATTAGGTTCT 

CTG CGTC ATT 

AAGCCCGCCA 

CCCCAGTCCG 

TCAGCAGCAG 

CTGCAGCTGC 

GGCAGAGGAG 
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GTTTTGGACG 
CGAGGAAG CT 
CGGTCGCATT 
ATGGCTACAA 
ACCCAACCGT 
AGCCGCCGCC 
TGGCGCGGGC 
CAACATCTCC 
TCCCCCGTAA 
ACCGGCGGCA 
GGCGACCGGA 
GCAGCAGCAG 
GACCCGCGAG 
AACAGAGCAG 
CGATCCCTCA 
GCGCACGCTG 
CTCTTAAGGA 
CGTCATCTCC 
TATGAGCAAG 
TGGGACTTGC 
ATGAGCGCGG 
CCGAAACCGA 
ATAACCTTAA 
CCCGCTCCCA 
GATGACTAAC 
GGTCGCCCGG 



AGGAGGAGGA 
TCCGAGGTCG 
CCCCTCGCCG 
CCTCCGCTCC 
AGATGGGACA 
GTTAGCCCAA 
ACAAGAACGC 
TTCGCCCGCC 
CATC CTGCAT 
GCGGCAGCGG 
TAG C AAGACT 
GAGGAGGAGC 
CTTAGAAACA 
GGGCCAAGAA 
CCCGCAGCTG 
GAAGACGCGG 
CTAGTTTCGC 
AGCGGCCACA 
GAAATTCCCA 
GGCTGGAGCT 
G AC C C C AC AT 
ATTCTCTTGG 
TCCCCGTAGT 
CCACTGTGGT 
TCAGGGGCGC 
GCAGGGTATA 



95 

GGACATGATG 
AAGAGGTGTC 
GCGCCCCAGA 
TCAGGCGCCG 
CCACTGGAAC 
GAGCAACAAC 
CATAGTTGCT 
GCTTTCTTCT 
TACTACCGTC 
CAGCAACAGC 
CTGACAAAGC 
GCTGCGTCTG 
GGATTTTTCC 
CAAGAGCTGA 
CCTGTATCAC 
AGGCTCTCTT 
GCCCTTTCTC 
CCCGGCGCCA 
CGCCCTACAT 
GCCCAAGACT 
GAT AT C CCGG 
AACA GGCGGC 
TGGCCCGCTG 
ACTTCCCAGA 
AGCTTGCGGG 
ACTCACCTGA 



GAAGACTGGG 

AGACGAAACA 

AATCGGCAAC 

CCGGCACTGC 

CAGGGCCGGT 

AGCGCCAAGG 

TGCTTGCAAG 

CTACCATCAC 

ATCTCTACAG 

AGCGGCCACA 

CCAAGAAATC 

GCGCCCAACG 

CACTCTGTAT 

AAATAAAAAA 

AAAAGCGAAG 

CAGTAAATAC 

AAATTTAAGC 

GCACCTGTCG 

GTGGAGTTAC 

ACTCAACCCG 

GTCAACGGAA 

TATTACCACC 

CCCTGGTGTA 

GACGCCCAGG 

CGGCTTTCGT 

CAATCAGAGG 



AGAGCCTAGA 
CCGTCACCCT 
CGGTTCCAGC 
CCGTTCGCCG 
AAGTCCAAGC 
CTAC CGCTCA 
ACTGTGGGGG 
GGCGTGGCCT 
CCCATACTGC 
CAGAAGCAAA 
CACAGCGGCG 
AACCCGTATC 
GCTATATTTC 
CAGGTCTCTG 
ATCAGCTTCG 
TGCGCGCTGA 
GCGAAAACTA 
TCAGCGCCAT 
CAGCCACAAA 
AATAAACTAC 
TCCGCGCCCA 
ACAC CTCGTA 
CCAGGAAAGT 
CCGAAGTTCA 
CACAGGGTGC 
GCGAGGTATT 



27650 

27700 

27750 

27800 

27850 

27900 

27950 

28000 

28050 

28100 

28150 

28200 

28250 

28300 

28350 

28400 

28450 

28500 

28550 

28600 

28650 

28700 

28750 

28800 

28850 



28900 




WO 96/39530 



PCT/US96/ 10245 



CAG CTCAACG 

GACATTTCAG 

CAATCCTAAC 

GGAACTCTGC 

CTTCTCGGGA 

ACGCGGTAAA 

GCAGAGCAAC 

CTTTGCCCGC 

ATATCGAGGG 

GCCCGTAGCC 

GGACAGGGGA 

GATTACATCA 

AGAAATTAAA 

TCTTCACCCG 

ATCTCTCCCT 

ACGAGAGAAC 

TCCTTACCTG 

CTACCGCCTG 

TGTTTACCAG 

AAAGGCGCAG 

CTATTCTAAT 

TTGTGATTCT 

GCCTGCTGTG 

TCGCCACCCA 

GTCAGCCCAC 

ATGTTACATT 



ACGAGTCGGT 

ATCGGCGGCG 

TCTGCAGACC 

AATTTATTGA 

CCTCCCGGCC 

GGACTCGGCG 

TGCGCCTGAA 

GACTCCGGTG 

CCCGGCGCAC 

TGATTCGGGA 

CCCTGTGTTC 

AGATCTTTGT 

ATATACTGGG 

CCCAAGCAAA 

CTGTGATTTA 

CTCTCCGAGC 

CCGGGAACGT 

ACCGTAAACC 

AACAGGAGGT 

CTACTGTGGG 

TCAGGTTTCT 

CTTTATTCTT 

TGCACATTTG 

AGATGATTAG 

GGTACCACCC 

CGCAGCTGAA 



96 

GAGCTCCTCG 

CCGGCCGTCC 

TCGTCCTCTG 

GGAGTTTGTG 

ACTAT CCGGA 

GACGGCTACG 

ACACCTGGTC 

AGTTTTGCTA 

GGCGTCCGGC 

GTTTACCCAG 

TCACTGTGAT 

TGCCATCTCT 

GCTCCTATCG 

CCAAGGCGAA 

CAACAGTTTC 

TCAGCTACTC 

ACGAGTGCGT 

AGACTTTTTC 

GAGCTTAGAA 

GTTTATGAAC 

CTAGAATCGG 

ATACTAACGC 

CATTTATTGT 

GTACATAATC 

AAAAGGTGGA 

GCTAATGAGT 



CTTGGTCTCC 

TTCATTCACG 

AGCCGCGCTC 

CCATCGGTCT 

TCAATTTATT 

ACTGAATGTT 

CACTGTCGCC 

CTTTGAATTG 

TTACCGCCCA 

CGCCCCCTGC 

TTGCAACTGT 

GTGCTGAGTA 

CCATCCTGTA 

CCTTACCTGG 

AACCCAGACG 

CATCAGAAAA 

CACCGGCCGC 

CGGACAGACC 

AACCCTTAGG 

AATTCAAGCA 

GGTTGGGGTT 

TTCTCTGCCT 

CAGCTTTTTA 

CTAGGTTTAC 

TTTTAAGGAG 

GCACCACTCT 



GTCCGGACGG 

CCTCGTCAGG 

TGGAGGCATT 

ACTTTAACCC 

CCTAACTTTG 

AAGTGGAGAG 

GCCACAAGTG 

CCCGAGGATC 

GGGAGAGCTT 

TAGTTGAGCG 

CCTAACCTTG 

TAATAAATAC 

AACGCCACCG 

TACTTTTAAC 

GAGTGAGTCT 

AACACCACCC 

TGCACCACAC 

TCAATAACTC 

GTATTAGGCC 

ACT CT ACGGG 

ATTCTCTGTC 

AAGGCTCGCC 

AACGCTGGGG 

TCACCCTTGC 

CCAGCCTGTA 

TATAAAATGC 
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ACCACAGAAC 

GTATGCTGTT 

TTACAGTTTT 

TTTTATGAAA 

GTGGCCCCCA 

CTATGCTAAT 

TACAAAAGCA 

TAAGTTACAA 

AACAAATTCA 

GGTCATTTCC 

ATATGCTCCA 

CTGACTTTGG 

CGACCCACCC 

CGGACTTACA 

ACTGGGATAA 

TGCCTTATTA 

ACCACCCATC 

TCCATAGATT 

TTAAATGAGA 

CTTTTTTGTG 

CTGCATTCCA 

CGCTCATCTG 

GACTGGGTCT 

GGACAGGACT 

TGTGACTTTT 

CCTCCAAGCC 



ATGAAAAGCT 
TATGCTATTT 
CCAGGGTAAA 
TGTGCGACAT 
CAAAATTGTG 
TACAGTGCTC 
GACGCAGCTT 
AGCTAATGTC 
AAAAGTTAGC 
TGCTCAATAC 
GCGCTACAAC 
CCAGCACCTG 
TAACAGAGAT 
TCTACCACAA 
CTTGGGCATG 
TTATGTGGCT 
TATAGTCC CA 
GGACGGACTG 
CATGATTCCT 
CGTGCTCCAC 
GCCTTCACAG 
CAGCCTCATC 
GTGTGCGCTT 
ATAGCTGAGC 
CTGCTGATTA 
TCAAAGACAT 



97 

GCTTATTCGC 
GGCAGCCAGG 
AG T CAT AAAA 
TACCATGTAC 
TGGAAAACAC 
GCTTTGGTCT 
TATTGAGGAA 
ACCACTAACT 
ATTATAATTA 
CATTCCCCTG 
CTTGAAGT CA 
TCCCGCGGAT 
GACCAACACA 
AT AC ACC CCA 
TGGTGGTTCT 
CATCTGCTGC 
TCATTGTGCT 
AAACACATGT 
CGAGTTTTTA 
ATTGGCTGCG 
TCTATTTGCT 
ACTGTGGTCA 
TGCATATCTC 
TTCTTAGAAT 
TTTGCACCCT 
ATATCATGCA 



CACAAAAACA 
TGACACTACA 
CTTTTATGTA 
ATGAGCAAAC 
TGGCACTTTC 
GTACCCTACT 
AAG AAAATG C 
GCTTTACTCG 
GAATAGGATT 
AACAATTGAC 
GGCTTCCTGG 
TTGTTCCAGT 
AC CAACGCGG 
AGTTTCTGCC 
CCATAGCGCT 
CTAAAGCGCA 
ACACCCAAAC 
TCTTTTCTCT 
TATTACTGAC 
gtttctcaca 

TTACGGATTT 

TCGCCTTTAT 

AGACACCATC 

TCTTTAATTA 

ATCTGCGTTT 

GATTCACTCG 



AAATTGGCAA 
GAGTATAATG 
TACTTTTCCA 
AGTATAAGTT 
TG CTGCACTG 
CTATATTAAA 
CTTAATTTAC 
CTGCTTGCAA 
TAAACCCCCC 
TCTATGTGGG 
ATGTCAGCAT 
CCAACTACAG 
CCGCCGCTAC 
TTTGTCAATA 
TATGTTTGTA 
AACGCGCCCG 
AATGATGGAA 
TACAGTATGA 
CCTTGTTGCG 
TCGAAGTAGA 
GTCACCCTCA 
CCAGTGCATT 
CCCAGTACAG 
TGAAATTTAC 
TGTTCCCCGA 
TATATGGAAT 



30250 

30300 

30350 

30400 

30450 

30500 

30550 

30600 

30650 

30700 

30750 

30800 

30850 

30900 

30950 

31000 

31050 

31100 

31150 

31200 

31250 

31300 

31350 

31400 

31450 



31500 
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ATTCCAAGTT 

TGCAATCATC 

TATATCCCTA 

CCAACTTTCC 

CGG CTTTGTC 

AAATCAGCTA 

AGAAATGGAC 

CAGCGGCCGA 

TTGCACCAGT 

CACCTACGAC 

CCAAGCGTCA 

ACTCAG C ACT 

ACCTGAGGAT 

TTATTCCCTT 

ATCAGTTAGC 

CCTCCCAGCT 

AATCTAAATG 

TATCTTCATG 

TCAACCCCGT 

TTTCTTACTC 

TGGGGTACTC 

TGCTTGCGCT 

CTTACCTCCC 

GTCAAACATA 



GCTACAATGA 

TCTGTTATGG 

CCTTGACATT 

CCGCGCCCGC 

CCAGCCAATC 

CTTTAATCTA 

GGAATTATTA 

GCAACAGCGC 

GCAAAAGGGG 

AGTAATACCA 

GAAATTGGTG 

CGGTAGAAAC 

CTCTGCACCC 

TAACTAATAA 
AAATTTCTGT 
CTGGTATTGC 
GAATGTCAGT 
TTGTTGCAGA 
GTATCCATAT 
CTCCCTTTGT 
TCTTT GCGCC 
CAAAATGGGC 
AAAATGTAAC 
AACCTGGAAA 



98 

AAAAAGCGAT 

TGTTCTGCAG 

GGCTGGAAAC 

TATGCTTCCA 

AGCCTCGCCC 

ACAGGAGGAG 

CAGAGCAGCG 

ATGAATCAAG 

TATCTTTTGT 

CCGGACACCG 

GTCATGGTGG 

CGAAGGCTGC 

TTATTAAGAC 

AAAAAAATAA 

CCAGTTTATT 

AGCTTCCTCC 

TTCCTCCTGT 

TGAAGCGCGC 

GACACGGAAA 

ATCCCCCAAT 

TATCCGAACC 

AACGGCCTCT 

CACTGTGAGC 

TATCTGCACC 



CTTTCCGAAG 

TACCATCTTA 

GAATAGATGC 

CTGCAACAAG 

CACTTCTCCC 

ATGACTGACA 

CCTGCTAGAA 

AGCTCCAAGA 

CTGGTAAAG C 

CCTTAGCTAC 

GAGAAAAGCC 

ATTCACTCAC 

CCTGTGCGGT 

TAAAGC AT CA 

CAGCAGCACC 

TGGCTGCAAA 

TCCTGTCCAT 

AAGACCGTCT 

CCGGTCCTCC 

GGGTTTCAAG 

TCTAGTTACC 

CTCTGGACGA 

CCACCTCTCA 

CCTCACAGTT 



CCTGGTTATA 

GCCCTAGCTA 

CATGAACCAC 

TTGTTGCCGG 

ACCCCCACTG 

CCCTAGATCT 

AGACGCAGGG 

CATGGTTAAC 

AGGCCAAAGT 

AAGTTGCCAA 

CATTACCATA 

CTTGTCAAGG 

CTCAAAGATC 

CTTACTTAAA 

TCCTTGCCCT 

CTTTCTCCAC 

CCGCACCCAC 

GAAGATACCT 

AACTGTGCCT 

AGAGTCCCCC 

TCCAATGGCA 

GGCCGGCAAC 

AAAAAACCAA 

ACCTCAGAAG 



31550 

31600 

31650 

31700 

31750 

31800 

31850 

31900 

31950 

32000 

32050 

32100 

32150 

32200 

32250 

32300 

32350 

32400 

32450 

32500 

32550 

32600 

32650 

32700 

32750 



CCCTAACTGT GGCTGCCGCC GCACCTCTAA TGGTCGCGGG CAACACACTC 
ACCATGCAAT CACAGGCCCC GCTAACCGTG CACGACTCCA AACTTAGCAT 



32800 
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TGCCACCCAA 


GGACCCCTCA 


CAGTGTCAGA 


AGGAAAGCTA 


GCCCTGCAAA 


32850 


CATCAGGCCC 


CCTCACCACC 


ACCGATAGCA 


GTACCCTTAC 


TATCACTGCC 


32900 


TCACCCCCTC 


TAACTACTGC 


CACTGGTAGC 


TTGGGCATTG 


ACTTGAAAGA 


32950 


GCCCATTTAT 


ACACAAAATG 


GAAAACTAGG 


ACTAAAGTAC 


GGGGCTCCTT 


33000 


TGCATGTAAC 


AGACGACCTA 


AACACTTTGA 


CCGTAGCAAC 


TGGTCCAGGT 


33050 


GTGACTATTA 


ATAATACTTC 


CTTGCAAACT 


AAAGTTACTG 


GAGCCTTGGG 


33100 


TTTTGATTCA 


CAAGGCAATA 


TGCAACTTAA 


TGTAGCAGGA 


GGACTAAGGA 


33150 


TTGATTCTCA 


AAACAGACGC 


CTTATACTTG 


ATGTTAGTTA 


TCCGTTTGAT 


33200 


GCTCAAAACC 


AACTAAATCT 


AAGACTAGGA 


CAGGGCCCTC 


TTTTTATAAA 


33250 


CTCAGCCCAC 


AACTTGGATA 


TTAACTACAA 


CAAAGGCCTT 


TACTTGTTTA 


33300 


CAGCTTCAAA 


CAATTCCAAA 


AAGCTTGAGG 


TTAACCTAAG 


CACTGCCAAG 


33350 


GGGTTGATGT 


TTGACGCTAC 


AGCCATAGCC 


ATTAATGCAG 


GAGATGGGCT 


33400 


TGAATTTGGT 


TCACCTAATG 


CACCAAACAC 


AAATCCCCTC 


AAAACAAAAA 


33450 


TTGGCCATGG 


CCTAGAATTT 


GATTCAAACA 


AGGCTATGGT 


TCCTAAACTA 


33500 


GGAACTGGCC 


TTAGTTTTGA 


CAGCACAGGT 


GCCATTACAG 


TAGGAAACAA 


33550 


AAATAATGAT 


AAGCTAACTT 


TGTGGACCAC 


ACCAGCTCCA 


TCTCCTAACT 


33600 


GTAGACTAAA 


TGCAGAGAAA 


GATGCTAAAC 


TCACTTTGGT 


CTTAACAAAA 


33650 


TGTGGCAGTC 


AAATACTTGC 


TACAGTTTCA 


GTTTTGGGTG 


TTAAAGGCAG 


33700 


TTTGGCTCCA 


ATATCTGGAA 


CAGTTCAAAG 


TGCTCATCTT 


ATTATAAGAT 


33750 


TTGACGAAAA 


TGGAGTGCTA 


CTAAACAATT 


CCTTCCTGGA 


CCCAGAATAT 


33800 


TGGAACTTTA 


GAAATGGAGA 


TCTTACTGAA 


GGCACAGCCT 


ATACAAACGC 


33850 


TGTTGGATTT 


ATGCCTAACC 


TATCAGCTTA 


TCCAAAATCT 


CACGGTAAAA 


33900 


CTGCCAAAAG 


TAACATTGTC 


AGTCAAGTTT 


ACTTAAACGG 


AGACAAAACT 


33950 


AAACCTGTAA 


CACTAACCAT 


TACACTAAAC 


GGTACACAGG 


AAACAGGAGA 


34000 


CACAACTCCA 


AGTGCATACT 


CTATGTCATT 


TTCATGGGAC 


TGGTCTGGCC 


34050 


ACAACTACAT 


TAATGAAATA 


TTTGCCACAT 


CCTCTTACAC 


TTTTTCATAC 


34100 
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ATTGCCCAAG 


AATAAAGAAT 


CGTTTGTGTT 


ATGTTTCAAC 


GTGTTTATTT 


34150 


TTCAATTGCA 


GAAAATTTCA 


AGTCATTTTT 


CATTCAGTAG 


TATAGCCCCA 


34200 


CCACCACATA 


GCTTATACAG 


ATCAC CGTAC 


CTTAATCAAA 


CT CACAGAAC 


34250 


CCTAGTATTC 


AACCTGCCAC 


CTCCCTCCCA 


ACACACAGAG 


TACACAGTCC 


34300 


TTTCTCCCCG 


GCTGGCCTTA 


AAAAGCATCA 


TATCATGGGT 


AACAGACATA 


34350 


TTCTTAGGTG 


TTATATTCCA 


CACGGTTTCC 


TGTCGAGCCA 


AACGCTCATC 


34400 


AGTGATATTA 


ATAAACT GGC 


GGCG ATATAA 


AATGCAAGGT 


GCTGCTCAAA 


34450 


AAATCAGGCA 


AAGCCTCGCG 


CAAAAAAGAA 


AGCACATCGT 


AGTCATGCTC 


34500 


ATGCAGATAA 


AGG CAGGTAA 


GCTCCGGAAC 


CACCACAGAA 


AAAGACACCA 


34550 


TTTTTCTCTC 


AAACATGTCT 


GCGGGTTTCT 


GC ATAAACAC 


AAAATAAAAT 


34600 


AACAAAAAAA 


CATTTAAACA 


TTAGAAGCCT 


GTCTTACAAC 


AGGAAAAACA 


34650 


ACCCTTATAA 


GCATAAGACG 


GACTACGGCC 


ATGCCGGCGT 


GACCGTAAAA 


34700 


AAACTGGTCA 


CCGTGATTAA 


AAAGCACCAC 


CGACAGCTCC 


TCGGTCATGT 


34750 


CCGGAGTCAT 


AATGTAAGAC 


TCGGTAAACA 


CATCAGGTTG 


ATTCATCGGT 


34800 


CAGTGCTAAA 


AAGCGACCGA 


AATAGCCCGG 


GGGAATACAT 


ACCCGCAGGC 


34850 


GTAGAGACAA 


CATTACAG C C 


CCCATAGGAG 


GTATAACAAA 


ATTAATAGGA 


34900 


GAGAAAAACA 


CATAAACACC 


TGAAAAAC CC 


TCCTGCCTAG 


GCAAAATAGC 


34950 


ACCCTCCCGC 


TCCAGAACAA 


CATACAGCGC 


TTCACAGCGG 


CAGCCTAACA 


35000 


GTCAGCCTTA 


CCAGTAAAAA 


AGAAAACCTA 


TTAAAAAAAC 


ACCACTCGAC 


35050 


ACGGCACCAG 


CTCAATCAGT 


CACAGTGTAA 


AAAAGGGCCA 


AGTG CAGAG C 


35100 


GAGTATATAT 


AGGACTAAAA 


AATGACGTAA 


CGGTTAAAGT 


CCACAAAAAA 


35150 


CACCCAGAAA 


ACCGCACGCG 


AACCTACGCC 


CAGAAACGAA 


AGCCAAAAAA 


35200 


CCCACAACTT 


CCTCAAATCG 


TCACTTCCGT 


TTTCCCACGT 


TACGTAACTT 


35250 


CCCATTTTAA 


GAAAACTACA 


ATTCCCAACA 


CATACAAGTT 


ACTCCGCCCT 


35300 


AAAACCTACG 


TCACCCGCCC 


CGTTCCCACG 


CCCCGCGCCA 


CGTCACAAAC 


35350 
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TCCACCCCCT CATTATCATA TTGGCTTCAA TCCAAAATAA GGTATATTAT 35400 
TGATGATG 35408 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8509 base pairs 
• - - (B) TYPE: nucleic acid 

(C) STRANDEDNESS! double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GCCCAATACG 


CAAACCGCCT 


CTCCCCGCGC 


AGCTGCGCGC 


TCGCTCGCTC 


ACTGAGGCCG 


CGGGCGACCT 


TTGGTCGCCC 


GGCCTCAGTG 


GGGAGTGGCC 


AACTCCATCA 


CTAGGGGTTC 


CGCCATGCTA 


CTTATCTACG 


TAGCCATTCT 


CATAACTTAC 


GGTAAATGG C 


CCGCCTGGCT 


CCATTGACGT 


CAATAATGAC 


GTATGTTCCC 


TTTCCATTGA 


CGTCAATGGG 


TGGAGTATTT 


CAGTACATCA 


AG TGTATCAT 


ATG C CAAGT A 


GACGGTAAAT 


GGCCCGCCTG 


GCATTATGCC 


CTTTCCTACT 


TGGCAGTACA 


TCTACGTATT 


TGATGCGGTT 


TTGGCAGTAC 


ATCAATGGGC 


CGGGGATTTC 


CAAGTCTCCA 


CCCCATTGAC 


GCACCAAAAT 


CAACGGGACT 


TTCCAAAATG 


TGACGCAAAT 


GGGCGGTAGG 


CGTGTACGGT 


GCTCGTTTAG 


TGAACCGTCA 


GATCGCCTGG 


TGACCTCCAT 


AGAAGACACC 


GGGACCGATC 


ATCCGGTACT 


CGAGGAACTG 


AAAAACCAGA 



GTTGGCCGAT TCATTAATGC 50 
CCCGGGCAAA GCCCGGGCGT 100 
AGCGAGCGAG CGCGCAGAGA 150 
CTTGTAGTTA ATGATTAACC 200 
CTAGCCCCTG CAGGTCGTTA 250 
GACCGCCCAA CGACCCCCGC 300 
ATAGTAACGC CAATAGGGAC 350 
ACGGTAAACT GCCCACTTGG 400 
CGCCCCCTAT TGACGTCAAT 450 
CAGTACATGA CCTTATGGGA 500 
AGTCATCGCT ATT AC CATGG 550 
GTGGATAGCG GTTTGACTCA 600 
GTCAATGGGA GTTTGTTTTG 650 
TCGTAACAAC TCCGCCCCAT 700 
GGGAGGTCTA TATAAGCAGA 750 
AGACGCCATC CACGCTGTTT 800 
CAGCCTCCGG ACTCTAGAGG 850 
AAGTTAACTG GTAAGTTTAG 



900 
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TCTTTTTGTC TTTTATTTCA GGTCCCGGAT CCGGTGGTGG TGCAAATCAA 950 
AGAACTGCTC CTCAGTGGAT GTTGCCTTTA CTTCTAGGCC TGTACGGAAG 1000 
TGTTACTTCT GCT CTAAAAG CTGCGGAATT GTACCCGCGG CCGCAATTCC 1050 
CGGGGATCGA AAGAGCCTGC TAAAGCAAAA AAGAAGTCAC CATGTCGTTT 1100 
ACTTTGACCA ACAAGAACGT GATTTTCGTT GCCGGTCTGG GAGGCATTGG 1150 
TCTGGACACC AGCAAGGAGC TGCTCAAGCG CGATCCCGTC GTTTTACAAC 1200 
GTCGTGACTG GGAAAACCCT GGCGTTACCC AACTTAATCG CCTTGCAGCA 1250 
CATCCCCCTT TCGCCAGCTG GCGTAATAGC GAAGAGGCCC GCACCGATCG 1300 
CCCTTCCCAA CAGTTGCGCA GCCTGAATGG CGAATGGCGC TTTGCCTGGT 1350 
TTCCGGCACC AGAAGCGGTG CCGGAAAGCT GGCTGGAGTG CGATCTTCCT 1400 
GAGGCCGATA CTGTCGTCGT CCCCTCAAAC TGGCAGATGC ACGGTTACGA 1450 
TGCGCCCATC TACACCAACG TAACCTATCC CATTACGGTC AATCCGCCGT 1500 
TTGTTCCCAC GGAGAATCCG ACGGGTTGTT ACTCGCTCAC ATTTAATGTT 1550 
GATGAAAGCT GGCTACAGGA AGGCCAGACG CGAATTATTT TTGATGGCGT 1600 
TAACTCGGCG TTTCATCTGT GGTGCAACGG GCGCTGGGTC GGTTACGGCC 1650 
AGGACAGTCG TTTGCCGTCT GAATTTGACC TGAGCGCATT TTTACGCGCC 1700 
GGAGAAAACC GCCTCGCGGT GATGGTGCTG CGTTGGAGTG ACGGCAGTTA 1750 
TCTGGAAGAT CAGGATATGT GGCGGATGAG CGGCATTTTC CGTGACGTCT 1800 
CGTTGCTGCA TAAACCGACT ACACAAATCA GCGATTTCCA TGTTGCCACT 1850 
CGCTTTAATG ATGATTTCAG CCGCGCTGTA CTGGAGGCTG AAGTTCAGAT 1900 
GTGCGGCGAG TTG CGTGACT ACCTACGGGT AACAGTTTCT TTATGGCAGG 1950 
GTGAAACGCA GGTCGCCAGC GGCACCGCGC CTTTCGGCGG TGAAATTATC 2000 
GATGAGCGTG GTGGTTATGC CGATCGCGTC ACACTACGTC TGAACGTCGA 2050 
AAACCCGAAA CTGTGGAGCG CCGAAATCCC GAATCTCTAT CGTGCGGTGG 2100 
TTGAACTGCA CACCGCCGAC GGCACGCTGA TTGAAGCAGA AGCCTGCGAT 2150 
GTCGGTTTCC G CGAGGTGCG GATTGAAAAT GGTCTGCTGC TGCTGAACGG 2200 
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CAAGCCGTTG 


CTGATTCGAG 


GCGTTAACCG 


TCACGAGCAT 


CATCCTCTGC 


2250 


ATGGTCAGGT 


CATGGATGAG 


CAGACGATGG 


TGCAGGATAT 


C CTGCTGATG 


2300 


AAGCAGAACA 


ACTTTAACGC 


CGTGCGCTGT 


TCGCATTATC 


CGAACCATCC 


2350 


GCTGTGGTAC 


ACGCTGTGCG 


ACCGCTACGG 


CCTGTATGTG 


GTGGATGAAG 


2400 


CCAATATTGA 


AACCCACGGC 


ATGGTGCCAA 


TGAATCGTCT 


GACCGATGAT 


2450 


CCGCGCTGGC 


TACCGGCGAT 


GAGCGAACGC 


GTAACGCGAA 


TGGTGCAGCG 


2500 


CGATCGTAAT 


CACCCGAGTG 


TGATCATCTG 


GTCGCTGGGG 


AATGAATCAG 


2550 


GCCACGGCGC 


TAATCACGAC 


GCGCTGTATC 


GCTGGATCAA 


ATCTGTCGAT 


2600 


CCTTCCCGCC 


CGGTGCAGTA 


TGAAGGCGGC 


GGAGCCGACA 


CCACGGCCAC 


2650 


CGATATTATT 


TGCCCGATGT 


ACGCGCGCGT 


GGATGAAGAC 


CAGCCCTTCC 


2700 


CGGCTGTGCC 


GAAATGGTCC 


ATCAAAAAAT 


GGCTTTCGCT 


ACCTGGAGAG 


2750 


ACGCGCCCGC 


TGATCCTTTG 


CGAATACGCC 


CACGCGATGG 


GTAACAGTCT 


2800 


TGGCGGTTTC 


GCTAAATACT 


GGCAGGCGTT 


TCGTCAGTAT 


CCCCGTTTAC 


2850 


AGGGCGGCTT 


CGTCTGGGAC 


TGGGTGGATC 


AGTCGCTGAT 


TAAATATGAT 


2900 


GAAAACGGCA 


ACCCGTGGTC 


GGCTTACGGC 


GGTGATTTTG 


GCGATACGCC 


2950 


GAACGATCGC 


CAGTTCTGTA 


TGAACGGTCT 


GGTCTTTGCC 


GACCGCACGC 


3000 


CGCATCCAGC 


GCTGACGGAA 


GO AAAACAC C 


AGCAGCAGTT 


TTTCCAGTTC 


3050 


CGTTTAT CCG 


GGCAAACCAT 


CGAAGTGACC 


AGCGAATACC 


TGTTCCGTCA 


3100 


TAGCGATAAC 


GAGCTCCTGC 


ACTGGATGGT 


GGCGCTGGAT 


GGTAAGCCGC 


3150 


TGGCAAGCGG 


TGAAGTGCCT 


CTGGATGTCG 


CTCCACAAGG 


TAAACAGTTG 


3200 


ATTGAACTG C 


CTGAACTACC 


GCAGCCGGAG 


A GCGCCGGGC 


AACTCTGGCT 


3250 


CACAGTACGC 


GTAGTGCAAC 


CGAACGCGAC 


CG CATGGT C A 


GAAGCCGGGC 


3300 


ACATCAGCGC 


CTGGCAGCAG 


TGGCGTCTGG 


CGGAAAACCT 


CAGTGTGACG 


3350 


CTCCCCGCCG 


CGTCCCACGC 


CATCCCGCAT 


CTGACCACCA 


GCGAAATGGA 


3400 


TTTTTGCATC 


GAGCTGGGTA 


ATAAGCGTTG 


GCAATTTAAC 


CGCCAGTCAG 


3450 


GCTTTCTTTC 


ACAGATGTGG 


ATTGGCGATA 


AAAAACAACT 


GCTGACGCCG 


3500 
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CTGCGCGATC AGTTCACCCG TGCACCGCTG GATAACGACA TTGGCGTAAG 3550 

TGAAGCGACC CGCATTGACC CTAACGCCTG GGTCGAACGC TGGAAGGCGG 3600 

CGGGCCATTA CCAGGCCGAA GCAGCGTTGT TGCAGTGCAC GGCAGATACA 3650 

CTTGCTGATG CGGTGCTGAT TACGACCGCT CACGCGTGGC AGCATCAGGG 3700 

GAAAACCTTA TTTATCAGCC GGAAAACCTA CCGGATTGAT GGTAGTGGTC 3750 

AAATGGCGAT TACCGTTGAT GTTGAAGTGG CGAGCGATAC ACCGCATCCG 3800 

GCGCGGATTG GCCTGAACTG CCAGCTGGCG CAGGTAGCAG AGCGGGTAAA 3850 

CTGGCTCGGA TTAGGGCCGC AAGAAAACTA TCCCGACCGC CTTACTGCCG 3900 

CCTGTTTTGA CCGCTGGGAT CTGCCATTGT CAGACATGTA TACCCCGTAC 3950 

GTCTTCCCGA GCGAAAACGG TCTGCG CTGC GGGACGCGCG AATTGAATTA 4000 

TGGCCCACAC CAGTGGCGCG GCGACTTCCA GTTCAACATC AGCCGCTACA 4050 

GTCAACAGCA ACTGATGGAA ACCAGCCATC GCCATCTGCT GCACGCGGAA 4100 

GAAGGCACAT GGCTGAATAT CGACGGTTTC CATATGGGGA TTGGTGGCGA 4150 

CGACTCCTGG AGCCCGTCAG TATCGGCGGA ATTACAGCTG AGCGCCGGTC 4200 

GCTACCATTA CCAGTTGGTC TGGTGTCAAA AATAATAATA ACCGGGCAGG 4250 

CCATGTCTGC CCGTATTTCG CGTAAGGAAA TCCATTATGT ACTATTTAAA 4300 

AAACACAAAC TTTTGGATGT TCGGTTTATT CTTTTTCTTT TACTTTTTTA 4350 

TCATGGGAGC CTACTTCCCG TTTTTCCCGA TTTGGCTACA TGACATCAAC 4400 

CATATCAGCA AAAGTGATAC GGGTATTATT TTTGCCGCTA TTTCTCTGTT 4450 

CTCGCTATTA TTCCAACCGC TGTTTGGTCT GCTTTCTGAC AAACTCGGCC 4500 

TCGACTCTAG GCGGCCGCGG GGATCCAGAC ATGATAAGAT ACATTGATGA 4550 

GTTTGGACAA ACCACAACTA GAATGCAGTG AAAAAAATGC TTTATTTGTG 4600 

AAATTTGTGA TGCTATTGCT TTATTTGTAA CCATTATAAG CTGCAATAAA 4650 

CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG TTCAGGGGGA 4700 

GGTGTGGGAG GTTTTTTCGG ATCCTCTAGA GTCGACCTGC AGGGGCTAGA 4750 

ATGGCTACGT AGATAAGTAG CATGGCGGGT TAATCATTAA CTACAAGGAA 4800 
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CCCCTAGTGA TGGAGTTGGC CACTCCCTCT CTGCGCGCTC GCTCGCTCAC 
TGAGGCCGGG CGACCAAAGG TCGCCCGACG CCCGGGC TTT GCCCGGGCGG 
CCTGAGTGAG CGAGCGAGCG CGCAGCTGGC GTAATAGCGA AGAGGCCCGC 
ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGAATTC 
CAGACGATTG AGCGTCAAAA TGTAGGTATT TCCATGAGCG TTTTTCCTGT 



TGCAATGGCT 
GTTTGAGTTC 
ATTGCGACAA 
CCTCACTGAT 
CTAAAATC C C 
GAGGAAAGCA 
GTAGCGGCGC 
GCTACACTTG 
CTTTCTCGCC 
TCCCTTTAGG 
CTTGATTAGG 
TTTTCGCCCT 
T C CAAACTGG 
TAAGGGATTT 
ACAAAAATTT 
TATTTGCTTA 
GGGGTACATA 
TCTTGTTTGC 
CCTCTCAAAA 
TTGAATATCA 
TTTGAATCTT 



GGCGGTAATA 

TTCTACTCAG 

CGGTTAATTT 

TATAAAAACA 

TTTAATCGGC 

CGTTATACGT 

ATTAAGCGCG 

CCAGCGCCCT 

ACGTTCGCCG 

GTTCCGATTT 

GTGATGGTTC 

TTGACGTTGG 

AACAACACTC 

TGCCGATTTC 

AACGCGAATT 

TACAATCTTC 

TGATTGACAT 

TCCAGACTCT 

ATAGCTACCC 

TATTGATGGT 

TACCTACACA 



TTGTTCTGGA 
GCAAGTGATG 
GCGTGATGGA 
CTT CTCAGGA 
CTCCTGTTTA 
GCTCGTCAAA 
GCGGGTGTGG 
AGCGCCCGCT 
GCTTTCCCCG 
AGTGCTTTAC 
ACGTAGTGGG 
AGTCCACGTT 
AACCCTATCT 
GGCCTATTGG 
TTAACAAAAT 
CTGTTTTTGG 
GCTAGTTTTA 
CAGG CAATGA 
TCTCCGGCAT 
GATTTGACTG 
TTACTCAGGC 



TATTACCAGC 
TTATTACTAA 
CAGACTCTTT 
TTCTGGCGTA 
GCTCCCGCTC 
GCAACCATAG 
TGGTTACGCG 
CCTTTCGCTT 
TCAAGCTCTA 
GGCACCTCGA 
CCATCGCCCT 
CTTTAATAGT 
CGGTCTATTC 
TTAAAAAATG 
ATTAACGTTT 
GGCTTTTCTG 
CGATTACCGT 
CCTGATAGCC 
GAATTT AT CA 
TCTCCGGCCT 
ATTGCATTTA 



AAGGCCGATA 

TCAAAGAAGT 

TACTCGGTGG 

CCGTTCCTGT 

TGATTCTAAC 

TACGCGCCCT 

CAGCGTGACC 

TCTTCCCTTC 

AATCGGGGGC 

CCCGAAAAAA 

GATAGACGGT 

GGACTCTTGT 

TTTTGATTTA 

AGCTGATTTA 

ACAATTTAAA 

ATTATCAACC 

TCATCGATTC 

TTTGTAGAGA 

GCTAGAACGG 

TTCTCACCCG 

AAATATATGA 



4850 

4900 

4950 

5000 

5050 

5100 

5150 

5200 

5250 

5300 

5350 

5400 

5450 

5500 

5550 

5600 

5650 

5700 

5750 

5800 

5850 

5900 

5950 

6000 

6050 



6100 




WO 96/39530 



PCT/US96/1 0245 



106 



GGGTTCTAAA 


AATTTTTATC 


CTTGCGTTGA 


AATAAAGGCT 


TCTCCCGCAA 


6150 


AAGTATTACA 


GGGTCATAAT 


GTTTTTGGTA 


CAACCGATTT 


AGCTTTATGC 


6200 


TCTGAGGCTT 


TATTGCTTAA 


TTTTGCTAAT 


TCTTTGCCTT 


GCCTGTATGA 


6250 


TTTATTGGAT 


GTTGGAATTC 


CTGATGCGGT 


ATTTTCTCCT 


TACGCATCTG 


6300 


TGCGGTATTT 


CACACCGCAT 


ATGGTGCACT 


CTCAGTACAA 


TCTGCTCTGA 


6350 


TGCCGCATAG 


TTAAGCCAGC 


CCCGACACCC 


GCCAACACCC 


GCTGACGCGC 


6400 


CCTGACGGGC 


TTGTCTGCTC 


CCGGCATCCG 


CTTACAGACA 


AGCTGTGACC 


6450 


GTCTCCGGGA 


GCTGCATGTG 


TCAGAGGTTT 


TCACCGTCAT 


CACCGAAACG 


6500 


CGCGAGACGA 


AAGGGCCTCG 


TGATACGCCT 


ATTTTTATAG 


GTTAATGTCA 


6550 


TGATAATAAT 


GGTTTCTTAG 


ACGTCAGGTG 


GCACTTTTCG 


GGGAAATGTG 


6600 


CGCGGAACCC 


CTATTTGTTT 


ATTTTTCTAA 


ATACATTCAA 


ATATGTATCC 


6650 


GCTCATGAGA 


CAATAACCCT 


GATAAATGCT 


TCAATAATAT 


TGAAAAAGGA 


6700 


AGAGTATGAG 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


CTTTTTTGCG 


6750 


GCATTTTGCC 


TTCCTGTTTT 


TG CTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


6800 


AGATGCTGAA 


GATCAGTT GG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


6850 


TCAACAGCGG 


TAAGATCCTT 


GAGAGTTTTC 


GCCCCGAAGA 


ACGTTTTCCA 


6900 


ATGATGAGCA 


CTTTTAAAGT 


TCTGCTATGT 


GGCGCGGTAT 


TATCCCGTAT 


6950 


TGACGCCGGG 


CAAGAGCAAC 


TCGGTCGCCG 


CATACACTAT 


TCTCAGAATG 


7000 


ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


7050 


ACAGTAAGAG 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


7100 


GGCCAACTTA 


CTTCTGACAA 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCGCTT 


7150 


TTTTGCACAA 


CATGGGGGAT 


CATGTAACTC 


GCCTTGATCG 


TTGGGAACCG 


7200 


GAGCTGAATG 


AAGCCATACC 


AAACGACGAG 


CGTGACACCA 


CGATGCCTGT 


7250 


AGCAATGGCA 


ACAACGTT GC 


GCAAACTATT 


AACTGGCGAA 


CTACTTACTC 


7300 


TAGCTTCCCG 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


7350 


GGACCACTTC 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


7400 
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ATCTGGAGCC 


GGTGAGCGTG 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


7450 


CAGATGGTAA 


GCCCTCCCGT 


ATCGTAGTTA 


TCTACACGAC 


GGGGAGTCAG 


7500 


GCAACTATGG 


ATGAACGAAA 


TAGACAG AT C 


GCTGAGATAG 


GTGCCTCACT 


7550 


GATTAAGCAT 


TGGTAACTGT 


CAGACCAAGT 


TTACTCATAT 


ATACTTTAGA 


7600 


TTGATTTAAA 


ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


GAAGAT CCTT 


7650 


TTTGATAATC 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


7700 


AGCGTCAGAC 


CCCGTAGAAA 


AGATCAAAGG 


atcttcttga 


GATCCTTTTT 


7750 


TTCTGCGCGT 


AATCTGCTGC 


TTGCAAACAA 


AAAAACCACC 


GCTAC GAGCG 


7800 


GTGGTTTGTT 


TGCCGGATCA 


AGAGCTACCA 


ACTCTTTTTC 


CGAAGGTAAC 


7850 


TGGCTTCAGC 


AGAGCGCAGA 


TACCAAATAC 


TGTCCTTCTA 


GTGTAGCCGT 


7900 


AGTTAGGCCA 


CCACTTCAAG 


AACTCTGTAG 


CACCGCCTAC 


ATACCTCGCT 


7950 


CTGCTAATCC 


TGTTACCAGT 


GGCTGCTGCC 


AGTGGCGATA 


AGTCGTGTCT 


8000 


TACGGGGTTG 


GACTCAAGAC 


GATAGTTACC 


GGATAAGGCG 


CAGCGGTCGG 


8050 


GCTGAACGGG 


GGGTTCGTGC 


ACACAGCCCA 


GCTTGGAGCG 


AACGACCTAC 


8100 


ACCGAACTGA 


GATACCTACA 


GCGTGAGCTA 


TGAGAAAGCG 


CCACGCTTCC 


8150 


CGAAGGGAGA 


AAGGCGGACA 


GGTATCCGGT 


AAGCGGCAGG 


GTCGGAACAG 


8200 


GAGAGCG CAC 


GAGGGAGCTT 


CCAGGGGGAA 


ACGCCTGGTA 


TCTTTATAGT 


8250 


CCTGTCGGGT 


TTCGCCACCT 


CTGACTTGAG 


CGTCGATTTT 


TGTGATGCTC 


8300 


GTCAGGGGGG 


CGGAGCCTAT 


GGAAAAACGC 


CAGCAACGCG 


GCCTTTTTAC 


8350 


GGTTCCTGGC 


CTTTTGCTGG 


CCTTTTGCTC 


ACATGTTCTT 


TC CTG CGTTA 


8400 


TCCCCTGATT 


CTGTGGATAA 


CCGTATTACC 


GCCTTTGAGT 


GAGCTGATAC 


8450 


CGCTCGCCGC 


AGCCGAACGA 


CCGAGCGCAG 


CGAGTCAGTG 


AGCGAGGAAG 


8500 



CGGAAGAGC 



8509 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCCCAATACG CAAACCGCCT CTCCCCGCGC GTTGGCCGAT TCATTAATG C 50 

AGCTGCGCGC TCGCTCGCTC ACTGAGGCCG CCCGGGCAAA GCCCGGGCGT 100 

CGGGCGACCT TTGGTCGCCC GGCCTCAGTG AGCGAGCGAG CGCGCAGAGA 150 

GGGAGTGGCC AACTCCATCA CTAGGGGTTC CTTGTAGTTA ATGATTAACC 200 

CGCCATGCTA CTTATCTACA TCATCGATGA ATTCGAGCTT GCATGCCTGC 250 

AGGTCGTTAC ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC 300 

GACCCCCGCC CATTGACGTC AATAATGACG TATGTTCCCA TAGTAACGCC 350 

AATAGGGACT TTC CATTGAC GTCAATGGGT GGAGTATTTA CGGTAAACTG 400 

CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC GCCCCCTATT 450 

GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 500 

CTTATGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA 550 

TTACCATGGT GATGCGGTTT TGGCAGTACA TCAATGGGCG TGGATAGCGG 600 

TTTGACTCAC GGGGATTTCC AAGTCTCCAC CCCATTGACG TCAATGGGAG 650 

TTTGTTTTGG CACCAAAATC AACGGGACTT TCCAAAATGT CGTAACAACT 700 

CCGCCCCATT GACGCAAATG GGCGGTAGGC GTGTACGGTG GGAGGTCTAT 750 

ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC 800 

ACGCTGTTTT GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGGA 850 

CTCTAGAGGA TCCGGTACTC GACCCGAGCT CGGATCCACT AGTAACGGCC 900 

GCCAGTGTGC TGGAATTCTG CACTCCAGGC TGCCCGGGTT TGCATGCTGC 950 

TGCTGCTGCT GCTGCTGGGC CTGAGGCTAC AGCTCTCCCT GGGCATCATC 1000 
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CTAGTTGAGG 


AGGAGAACCC 


GGACTTCTGG 


AACCGCGAGG 


CAGCCGA GGC 


1050 


CCTGGGTGCC 


G C CAAGAAGC 


TGCAGCCTGC 


ACAGACAGCC 


GO CAAGAACC 


1100 


TCATCATCTT 


CCTGGGCGAT 


GGGATGGGGG 


TGTCTACGGT 


GACAGCTGCC 


1150 


AGGATCCTAA 


AAGGGCAGAA 


GAAGGACAAA 


CTGGGGCCTG 


AGATACCCCT 


1200 


GGCCATGGAC 


CGCTTCCCAT 


ATGTGGCTCT 


GTC CAAGACA 


TACAATGTA G 


1250 


ACAAACATGT 


GCCAGACAGT 


GGAGCCACAG 


CCACGGCCTA 


CCTGTGCGGG 


1300 


GTCAAGGGCA 


ACTTCCAGAC 


CATTGGCTTG 


AGTGCAGCCG 


CCCGCTTTAA 


1350 


CCAGTGCAAC 


ACGACACGCG 


GCAACGAGGT 


CATCTCCGTG 


ATGAATCGGG 


1400 


CCAAGAAAGC 


AGGGAAGTCA 


GTGGGAGTGG 


TAACCACCAC 


ACGAGTGCAG 


1450 


CACGCCTCGC 


CAGCCGGCAC 


CTACGCCCAC 


ACGGTGAACC 


GCAACTGGTA 


1500 


CTCGGACGCC 


GACGTGCCTG 


CCTCGGCCCG 


CCAGGAGGGG 


TGCCAGGACA 


1550 


TCGCTACGCA 


GCTCATCTCC 


AACATGGACA 


TTGATGTGAT 


C CTAGGTGGA 


1600 


GGCCGAAAGT 


ACATGTTTCG 


CATGGGAACC 


CCAGACCCTG 


AGTACCCAGA 


1650 


TGACTACAGC 


CAAGGTGGGA 


CCAGGCTGGA 


CGGGAAGAAT 


CTGGTGCAGG 


1700 


AATGGCTCGG 


CGAACGCCAG 


GGTGCCCGGT 


ACGTGTGGAA 


CCGCACTGAG 


1750 


CTCATGCAGG 


CTTCCCTGGA 


CCCGTCTGTG 


ACCCATCTCA 


TGGGTCTCTT 


1800 


TGAGCCTGGA 


GACATGAAAT 


ACGAGATCCA 


CCGAGACTCC 


ACACTGGACC 


1850 


CCTCCCTGAT 


GGAGATGACA 


GAGGCTGCCC 


TGCGCCTGCT 


GAGCAGACAC 


1900 


CCCCGCGGCT 


TCTTCCTCTT 


CGTGGAGGGT 


GGTCGCATCG 


ACCATGGTCA 


1950 


TCATGAAAGC 


AGGGCTTACC 


GGGCACTGAC 


TGAGACGATC 


ATGTTCGACG 


2000 


ACGCCATTGA 


GAGGGCGGGC 


CAGCTCACCA 


GCGAGGAGGA 


CACGCTGAGC 


2050 


CTCGTCACTG 


CCGACCACTC 


CCACGTCTTC 


TCCTTCGGAG 


GCTACCCCCT 


2100 


GCGAGGGAGC 


TCCTTCATCG 


GGCTGGCCGC 


TGGCAAGGCC 


CGGGACAGGA 


2150 


AGGCCTACAC 


GGTCCTCCTA 


TACGGAAACG 


GTCCAGGCTA 


TGTGCTCAAG 


2200 


GACGGCGCCC 


GGCCGGATGT 


TACCGAGAGC 


GAGAGCGGGA 


GCCCCGAGTA 


2250 


TCGGCAGCAG 


TCAGCAGTGC 


CCCTGGACGA 


AGAGACCCAC 


GCAGGCGAGG 


2300 
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ACGTGGCGGT 


GTTCGCGCGC 


GGCCCGCAGG 


CGCACCTGGT 


TCACGGCGTG 


2350 


CAGGAGCAGA 


CCTTCATAGC 


GCACGTCATG 


GCCTTCGCCG 


CCTGCCTGGA 


2400 


GCCCTACACC 


GCCTGCGACC 


TGGCGCCCCC 


CGCCGGCACC 


ACCGACGCCG 


2450 


CGCACCCGGG 


GCGGTCCGTG 


GTCCCCGCGT 


TGCTTCCTCT 


GCTGGCCGGG 


2500 


ACCCTGCTGC 


TGCTGGAGAC 


GGCCACTGCT 


CCCTGAGTGT 


CCCGTCCCTG 


2550 


GGGCTCCTGC 


TTCCCCATCC 


CGGAGTT CTC 


CTGCTCCCCA 


CCTCCTGTCG 


2600 


TCCTGCCTGG 


CCTCCAGCCC 


GAGTCGTCAT 


CCCCGGAGTC 


CCTATACAGA 


2650 


GGTCCTGCCA 


TGGAACCTTC 


CCCTCCCCGT 


GCGCTCTGGG 


GACTGAGCCC 


2700 


ATGACACCAA 


ACCTGCCCCT 


TGGCTGCTCT 


CGGACTCCCT 


ACCCCAACCC 


2750 


CAGGGACTGC 


AGGTTGTGCC 


CTGTGGCTGC 


CTGCACCCCA 


GGAAAGGAGG 


2800 


GGGCTCAGGC 


CATCCAGCCA 


CCACCTACAG 


CCGAGTGGGG 


TCGAGACAGA 


2850 


TGGTCAGTCT 


GGAGGATGAC 


GTGGCGTGAA 


GCTGGCCGCG 


GGGATCCAGA 


2900 


CATGATAAGA 


TACATTGATG 


AGTTTGGACA 


AACCACAACT 


AGAATGCAGT 


2950 


GAAAAAAATG 


CTTTATTTGT 


GAAATTTGTG 


ATGCTATTGC 


TTTATTTGTA 


3000 


ACCATTATAA 


GCTGCAATAA 


ACAAGTTAAC 


AACAACAATT 


gcattcattt 


3050 


TATGTTTCAG 


GTTCAGGGGG 


AGGTGTGGGA 


GGTTTTTTCG 


GATCCTCTAG 


3100 


AGTCGACTCT 


AGANNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3150 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3200 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3250 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3300 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


3350 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNGGATCCC 


3400 


CATGACTACG 


TCCGGCGTTC 


CATTTGGCAT 


GACACTACGA 


CCAACACGAT 


3450 


CTCGGTTGTC 


TCGGCGCACT 


CCGTACAGTA 


GGGATCGTCT 


ACCTCCTTTT 


3500 


GAGACAGAAA 


CCCGCGCTAC 


CATACTGGAG 


GATCATCCGC 


TGCTGCCCGA 


3550 


ATGTAACACT 


TTGACAATGC 


ACAACGTGAG 


TTACGTGCGA 


GGTCTTCCCT 


3600 
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GCAGTGTGGG 


ATTTACGCTG 


ATTCAGGAAT 


GGGTTGTTCC 


CTGGGATATG 


3650 


GTTCTAACGC 


GGGAGGAGCT 


TGTAATCCTG 


AGGAAGTGTA 


TGCACGTGTG 


3700 


CCTGTGTTGT 


GCCAACATTG 


ATATCATGAC 


GAGCATGATG 


ATCCATGGTT 


3750 


ACGAGTCCTG 


GGCTCTCCAC 


TGTCATTGTT 


CCAGTCCCGG 


TTCCCTGCAG 


3800 


TGTATAGCCG 


GCGGGCAGGT 


TTTGGCCAGC 


TGGTTTAGGA 


TGGTGGTGGA 


3850 


TGGCGCCATG 


TTTAATCAGA 


GGTTTATATG 


GTACCGGGAG 


GTGGTGAATT 


3900 


ACAACATGCC 


AAAAGAGGTA 


ATGTTTATGT 


CCAGCGTGTT 


TATGAGGGGT 


3950 


CGCCACTTAA 


TCTACCTGCG 


CTTGTGGTAT 


GATGGCCACG 


TGGGTTCTGT 


4000 


GGTCCCCGCC 


ATGAGCTTTG 


GATACAGCGC 


CTTGCACTGT 


GGGATTTTGA 


4050 


ACAATATTGT 


GGTGCTGTGC 


TGCAGTTACT 


GTGCTGATTT 


AAGTGAGATC 


4100 


AGGGTGCGCT 


GCTGTGCCCG 


GAGGAGAAGG 


CGCCTTATGC 


TGCGGGCGGT 


4150 


GCGAATCATC 


GCTGAGGAGA 


CCACTGCCAT 


GTTGTATTC C 


TGCAGGACGG 


4200 


A GCGGCGGCG 


GCAG CAGTTT 


ATTCGCGCGC 


TGCTGCAGCA 


CCACCGCCCT 


4250 


ATCCTGATGC 


ACGATTATGA 


CTCTACCCCC 


ATGTAGGGAT 


CCCCATCACT 


4300 


AGTGCGGCCG 


CGGGGATCCA 


GACATGATAA 


GATACATTGA 


TGAGTTTGGA 


4350 


CAAACCACAA 


CTAGAATGCA 


GTGAAAAAAA 


TGCTTTATTT 


GTGAAATTTG 


4400 


TGATGCTATT 


gctttatttg 


TAACCATTAT 


AAGCTGCAAT 


AAACAAGTTA 


4450 


ACAACAACAA 


TTGCATTCAT 


TTTATGTTTC 


AGGTTCAGGG 


GGAGGTGTGG 


4500 


GAGGTTTTTT 


CGGATCCTCT 


AGAGTCGACC 


TGCAGGCATG 


CAAG CTGTAG 


4550 


ATAAGTAGCA 


TGGCGGGTTA 


ATCATTAACT 


ACAAGGAACC 


CCTAGTGATG 


4600 


GAGTTGGC C A 


CTCCCTCTCT 


GCGCGCTCGC 


TCGCTCACTG 


AGGCCGGGCG 


4650 


ACCAAAGGTC 


GCCCGACGCC 


CGGGCTTTGC 


CCGGGCGGCC 


TCAGTGAGCG 


4700 


AGCGAGCGCG 


CAGCTGGCGT 


AATAGCGAAG 


AGGCCCGCAC 


CGATCGCCCT 


4750 


TCCCAACAGT 


TGCGCAGCCT 


GAATGGCGAA 


TGGAANTTCC 


AGACGATTGA 


4800 


GCGTCAAAAT 


GTAGGTATTT 


CCATGAGCGT 


TTTTCCTGTT 


GCAATGGCTG 


4850 


GCGGTAATAT 


TGTTCTGGAT 


ATTACCAGCA 


AGGCCGATAG 


TTTGAGTTCT 


4900 
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TCTACTCAGG CAAGTGATGT TATTACTAAT CAAAGAAGTA TTGCGACAAC 4950 

GGTTAATTTG CGTGATGGAC AGACTCTTTT ACTCGGTGGC CTCACTGATT 5000 

ATAAAAACAC TTCTCAGGAT TCTGGCGTAC CGTTCCTGTC TAAAATCCCT 5050 

TTAATCGGCC TCCTGTTTAG CTCCCGCTCT GATTCTAACG AGGAAAGCAC 5100 

GTTATACGTG CTCGTCAAAG CAACCATAGT ACGCGCCCTG TAGCGGCGCA 5150 

TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 5200 

CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA 5250 

CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG 5300 

TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC TTGATTAGGG 5350 

TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT TTTCGCCCTT 5400 

TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT CCAAACTGGA 5450 

ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 5500 

GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA 5550 

ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTAAAT ATTTG CTTAT 5600 

ACAATCTTCC TGTTTTTGGG GCTTTTCTGA TTATCAACCG GGGTACATAT 5650 

GATTGACATG CTAGTTTTAC GATTACCGTT CATCGATTCT CTTGTTTGCT 5700 

CCAGACTCTC AGGCAATGAC CTGATAGCCT TTGTAGAGAC CTCT CAAAAA 5750 

TAGCTACCCT CTCCGGCATG AATTTATCAG CTAGAACGGT TGAATATCAT 5800 

ATTGATGGTG ATTTGACTGT CTCCGGCCTT TCTCACCCGT TTGAATCTTT 5850 

ACCTACACAT TACTCAGGCA TTGCATTTAA AATATATGAG GGTTCTAAAA 5900 

ATTTTTATCC TTGCGTTGAA ATAAAGGCTT CTCCCGCAAA AGTATTAGAG 5950 

GGTCATAATG TTTTTGGTAC AACCGATTTA GCTTTATGCT CTGAGGCTTT 6000 

ATTGCTTAAT TTTGCTAATT CTTTGCCTTG CCTGTATGAT TTATTGGATG 6050 

TTGGAANTTC CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT 6100 

CACACCGCAT ATGGTGCACT CTCAGTACAA TCTGCTCTGA TGCCGCATAG 6150 

TTAAGCCAGC CCCGACACCC GCCAACACCC GCTGACGCGC CCTGACGGGC 6200 




WO 96/39530 



PCT/US96/10245 



113 



TTGTCTGCTC 


CCGGCATCCG 


CTTACAGACA 


AGCTGTGACC 


GTCTCCGGGA 


6250 


GCTGCATGTG 


TCAGAGGTTT 


TCACCGTCAT 


CACCGAAACG 


CGCGAGACGA 


6300 


AAGGGCCTCG 


TGATACGCCT 


ATTTTTATAG 


gttaatgtca 


TGATAATAAT 


6350 


GGTTTCTTAG 


ACGTCAGGTG 


GCACTTTTCG 


GGGAAATGTG 


CGCGGAACCC 


6400 


CTATTTGTTT 


ATTTTTCTAA 


ATACATT CAA 


ATATGTATCC 


GCTCATGAGA 


6450 


CAATAAC CCT 


GATAAATGCT 


TCAATAATAT 


TGAAAAAGGA 


AGAGTATGAG 


6500 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


CTTTTTTGCG 


GCATTTTGCC 


6550 


TTCCTGTTTT 


TGCTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


AGATGCTGAA 


6600 


GATCAGTTGG 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


TCAACAGCGG 


6650 


TAAGATCCTT 


GAGAGTTTTC 


GCCCCGAAGA 


ACGTTTTCCA 


ATGATGAGCA 


6700 


CTTTTAAAGT 


T CTGCTATGT 


GGCGCGGTAT 


TATCCCGTAT 


TGACGCCGGG 


6750 


CAAGAGCAAC 


TCGGTCGCCG 


CATACACTAT 


T CTC AGAATG 


ACTTGGTTGA 


6800 


GTACTCACCA 


GTCACAGAAA 


AG CATCTTAC 


GGATGGCATG 


ACAGTAAGAG 


6850 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


GGCCAACTTA 


6900 


CTTCTGACAA 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCGCTT 


TTTTGCACAA 


6950 


CATGGGGGAT 


CATGTAACTC 


GCCTTGATCG 


TTGGGAACCG 


GAGCTGAATG 


7000 


AAGCCATACC 


AAACGACGAG 


CGTGACACCA 


CGATGCCTGT 


AGCAATGGCA 


7050 


ACAACGTTGC 


G CAAACTATT 


AACTGGCGAA 


CTACTTACTC 


TAGCTTCCCG 


7100 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


GGACCACTTC 


7150 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


ATCTGGAGCC 


7200 


GGTGAGCGTG 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


CAGATGGTAA 


7250 


GCCCTCCCGT 


ATCGTAGTTA 


TCTACACGAC 


GGGGAGTCAG 


GGAACTATGG 


7300 


ATGAACGAAA 


TAGACAGATC 


GCTGAGATAG 


GTGCCTCACT 


GATT AAG CAT 


7350 


TGGTAACTGT 


CAGACCAAGT 


TT ACT C ATAT 


ATACTTTAGA 


TTGATTTAAA 


7400 


ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


GAAGATCCTT 


TTTGATAATC 


7450 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


AGCGTCAGAC 


7500 
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CCCGTAGAAA 


AGATCAAAGG 


ATCTTCTTGA 


GATC CTTTTT 


TTCTGCGCGT 


7550 


AATCTGCTGC 


TTGCAAACAA 


AAAAACCACC 


GCTACCAGCG 


GTGGTTTGTT 


7600 


TGCCGGATCA 


AGAGCTACCA 


ACTCTTTTTC 


CGAAGGTAAC 


TGGCTTCAGC 


7650 


AGAGCGCAGA 


TACCAAATAC 


TGTCCTTCTA 


GTGTAGCCGT 


AGTTAGGCCA 


7700 


CCACTTCAAG 


AACTCTGTAG 


CACCGCCTAC 


ATACCTCGCT 


CTGCTAATCC 


7750 


TGTTACCAGT 


GGCTGCTGCC 


AGTGGCGATA 


AGTCGTGTCT 


TACCGGGTTG 


7800 


GACTCAAGAC 


GATAGTTACC 


GGATAAGGCG 


CAGCGGTCGG 


GCTGAACGGG 


7850 


GGGTTCGTGC 


ACACAGCCCA 


GCTTGGAGCG 


AACGACCTAC 


ACCGAACTGA 


7900 


GATACCTACA 


GCGTGAGCTA 


TGAGAAAGCG 


CCACGCTTCC 


CGAAGGGAGA 


7950 


AAGGCGGACA 


GGTATCCGGT 


AAGCGGCAGG 


GTCGGAACAG 


GAGAG CGCAC 


8000 


GAGGGAGCTT 


CCAGGGGGAA 


ACG C C TGGTA 


TCTTTATAGT 


CCTGTCGGGT 


8050 


TTCGCCACCT 


CTGACTTGAG 


CGTCGATTTT 


TGTGATGCTC 


GTCAGGGGGG 


8100 


CGGAG CCTAT 


GGAAAAACGC 


CAGCAA CGCG 


GCCTTTTTAC 


GGTTCCTGGC 


8150 


CTTTTGCTGG 


CCTTTTGCTC 


ACATGTTCTT 


TC CTGCGTTA 


TCCCCTGATT 


8200 


CTGTGGATAA 


CCG TATTACC 


GCCTTTGAGT 


GAGCTGATAC 


CGCTCGCCGC 


8250 


AGCCGAACGA 


CCGAGCGCAG 


CGAGTCAGTG 


AGCGAGGAAG 


CGGAAGAGC 


8299 
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WHAT IS CLAIMED IS: 

1. A method for enhancing the efficiency of 

transduction of a recombinant AAV into a target cell ex 
vivo comprising the steps of providing a recombinant 
adeno-associated virus comprising: (a) the DNA of at 

least a portion of the genome of an adeno-associated 

-virus which port ion is capable - of transducing a selected 
gene into a target cell in the absence of cell division; 
and (b) a selected gene operatively linked to regulatory 
sequences directing its expression, said gene flanked by 
the DNA of (a) and capable of expression in the target 
cell; infecting said target cells with said recombinant 
adeno-associated virus; contacting said infected cells 
with an agent which facilitates the conversion of said ss 
recombinant virus to its double stranded form, wherein 
said conversion occurs in said target cell, resulting in 
enhanced transduction of said recombinant virus into said 
target cell. 

2 . The method according to claim 1 wherein said 
agent is a helper virus comprising a selected gene 
encoding a polypeptide which can enhance the conversion 
of said single-stranded recombinant virus to double- 
stranded recombinant virus or functional fragment 
thereof, said helper virus capable of transducing said 
selected gene into the target cell in the absence of cell 
division. 



3 . The method according to claim 2 wherein said 
contacting step comprises co-infecting said target cell 
with said helper virus. 

4. The method according to claim 3 wherein said 
helper virus is an adenovirus. 
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5- The method according to claim 2 wherein said 
selected gene is the adenovirus E4 gene or a functional 
fragment thereof. 

6. The method according to claim 2 wherein said 
adenovirus contains two selected genes or functional 

— fragments thereof, said genes being the adenovirus E4 
gene and El gene. 

7. The method according to claim 5 or 6 wherein 
said functional fragment of said E4 gene comprises the 
open reading frame 6 of E4. 

8. The method according to claim 1 wherein said 
recombinant adeno-associated virus further comprises a 
second selected gene operatively linked to regulatory 
sequences capable of directing expression of said second 
gene, said second gene capable of facilitating the 
conversion of said single stranded recombinant virus to 
its double stranded form upon expression, and capable of 
co-expression with said first selected gene in the target 
cell, whereby the second gene product is expressed in the 
target cell, wherein said conversion occurs in said 
target cell, resulting in enhanced transduction of said 
recombinant virus in said target cell. 

9. The method according to claim 8 wherein the 
regulatory sequences directing expression of said second 
gene comprise an inducible promoter, and wherein 
expression of said second gene occurs in the presence of 
an inducing agent. 
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10. A recombinant adeno-associated virus comprising 
(a) the DNA of at least a portion of the genome of an 
adeno-associated virus which portion is capable of 
transducing at least two selected genes or functional 
fragments thereof into a target cell in the absence of 
cell division; (b) a first selected gene operatively 

linked -to- regulatory-sequences directing its expression, 

(c) a second selected gene operatively linked to 
regulatory sequences capable of directing expression of 
said second gene, said second gene capable of 
facilitating the conversion of said single stranded 
recombinant virus to its double stranded form upon 
expression, said first and said second genes flanked by 
the DNA of (a) , said first and second gene capable of co- 
expression in the target cell. 

11. The recombinant virus according to claim 10 
wherein said second gene is selected from the group 
consisting of an adenovirus E4 gene, ORF6 of E4 and a 
functional fragment thereof. 

12 . The recombinant virus according to claim 10 
wherein said second gene is the adenovirus El gene or a 
functional fragment thereof. 

13 . The recombinant virus according to claim 10 
wherein the regulatory sequences directing expression of 
said second gene comprise an inducible promoter, and 
wherein expression of said second gene occurs in the 
presence of an inducing agent. 
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14 . The recombinant virus according to claim 10 
further comprising (d) an additional selected gene 
operatively linked to regulatory sequences capable of 
directing expression of said gene, said additional gene 
and said second gene capable of jointly facilitating the 
conversion of said single stranded recombinant virus to 

— ~ its double stranded- form- upon expression of both said 
second and additional genes, said first, second and 
additional genes flanked by the DNA of (a) , and capable 
of co-expression in the target cell. 

15. The recombinant virus according to claim 14 
wherein said second gene is the adenovirus E4 gene or a 
functional fragment thereof and said additional gene is 
the adenovirus El gene or a functional fragment thereof. 

16. The recombinant virus according to claim 15 
wherein said functional fragment of E4 is the ORF6 
sequence. 



17 . The recombinant virus according to claim 15 
wherein the regulatory sequences directing expression of 
said additional gene comprise an inducible promoter, and 
wherein expression of said additional gene occurs in the 
presence of an inducing agent. 

18. A method for enhancing the efficiency of 

transduction of a recombinant AAV into a target cell 
comprising the steps of: providing a recombinant adeno- 

associated virus of claim 10 through 16; infecting said 
target cells with said recombinant virus; and culturing 
said infected cells under conditions which enable 
expression of said selected genes in said target cell. 
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wherein said conversion occurs in said target cell, 
resulting in enhanced transduction of said recombinant 
virus in said target cell. 

19. A method for enhancing the efficiency of 
transduction of a recombinant AAV into a target cell 

“comprising the steps of: providing a recombinant adeno- 
associated virus of claim 17 ; infecting said target cells 
with said recombinant virus; and contacting said target 
cell with an inducing agent which induces the expression 
of said second or additional gene in said target cell, 
wherein said conversion occurs in said target cell, 
resulting in enhanced transduction of said recombinant 
virus in said target cell. 

20. A pharmaceutical composition comprising a 
recombinant adeno-associated virus and an agent which 
facilitates the conversion of said ss recombinant virus 
to its double stranded form in a target cell. 

21. The composition according to claim 2 0 wherein 
said virus is selected from the group consisting of the 
recombinant viruses of claims 10-16. 

22. The composition according to claim 20 wherein 
said virus is the recombinant virus of claim 17 and 
further comprising an inducing agent which induces the 
expression of said second or additional gene in said 
recombinant virus. 

23. A mammalian cell transduced with the 
recombinant adeno-associated viruses of claims 10-17. 
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